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ABSTRACT 


This  report  is  a  study  chi  some  elementary  information  handling  properties 
of  neuromime  nets,  giving  most  emphasis  to  the  functioning  of  a  single 
neuromime  component,  and  containing  some  discussion  of  the  operation  of 
simple  nets.  Single  component  ccwputation  is  treated  from  the  point  of 
view  of  changes  brought  about  in  the  internal  structure  by  operations 
performed  during  data  flow.  A  geometrical  model  is  presented  which 
illustrates  the  pattern  measurement  behavior  of  the  component ,  and  some 
of  the  simpler  differential  equations  of  adaptation  are  solved  to  provia*» 
some  insight  into  the  effect  and  interaction  of  the  component  control 
parameters .  Simple  net  behavior  is  concerned  mainly  with  feedback  inter¬ 
action  among  components,  and  gives  some  useful  natation  for  describing  net 
operation . 
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Section  I 


INTRODUCTION 

This  report  is  a  description  of  the  computations  performed  by  a  type 
of  neural  net,  and  can  function  as  a  programming  manual  for  this  net. 

The  contract  under  which  this  report  was  prepared  is  a  part  of  an 
effort  to  develop  more  effective  information  handling  systems,  in  the 
sense  of  extracting  usable  knowledge  from  received  signals. 

More  generally,  the  overall  effort  is  concerned  with  the  development 
of  information  processing  systems  with  capabilities  approaching  those  of 
the  human  brain.  The  approach  being  taken  to  solve  this  problem  is  to 
analyze  the  behavior  of  the  brain,  breaking  it  down  into  functional  components, 
then  analyzing  each  component  until  a  basic  component  is  reached  which  can  be 
constructed  with  present  technology.  These  components  are  then  constructed 
(or  simulated)  and  put  together  in  various  systems,  and  the  behavior  of  these 
systems  compared  with  actual  cognitive  operations. 

Emphasis  has  been  placed  on  the  functioning  of  individual  components  and 
simple  nets.  Section  II  gives  a  basic  description  of  the  operations  performed 
by  components,  the  kinds  of  net  structures  the  component  is  designed  to  form, 
and  a  general  discussion  of  the  computational  parameters.  Section  III  describes 
informally  the  computations  performed  by  a  single  neuromime,  with  particular 
attention  to  adaptation.  Section  IV  is  a  more  rigorous  discussion  of  the 
computational  processes  of  a  component,  section  V  of  the  adaptation  process, 
and  section  VI  discusses  simple  nets.  Section  VII  considers  a  number  of 
extensions,  conjectures  and  unsolved  problems  that  have  arisen  in  connection 
with  this  research. 
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Section  II 


GENERAL  PROPERTIES  OF  NEUROMIME 
AND  NETS 


The  problem  under  consideration  is  that  of  building  a  flexible  pattern- 
recognizing  device  to  be  used  in  decoding  signals  produced  by  some  sort  of 
sensory  encoders.  Instead  of  producing  a  device  directly,  it  is  desired  to 
investigate  the  design  of  a  more  general  computa-tion  system  that  would  be 
self-organizing. 

This  report  is  an  analysis  of  some  of  the  mathematical  properties  of  a 
computation  system  developed  by  the  contract  monitor.*  The  system  is  modeled 
after  an  organizational  mechanism  shown  in  Figure  2.1.  There  are  a  series  of 
computation  areas  with  a  number  of  transmission  lines  connecting  them  with 
each  other,  and  with  the  outside,  both  input  and  output.  The  computer  would 
have  the  ability  to  alter  the  computation  parameters  of  an  area  in  such  a 
way  as  to  alter  its  responses  with  time  and  it  would  also  have  the  property 
that  some  areas  could  be  activated  and  others  de-activated  for  periods  of 
time . 


A  computational  element,  called  a  neuromime,  has  been  designed  for 
the  system  and  is  illustrated  in  Figure  2.2.  There  are  four  sets  of  10 
input  lines  each,  called  I  ,  I  ,  S  ,  and  S  •  Each  set  can  be  regarded  as 
a  10  dimensional  vector.  Since  all  transmission  to  and  from  the  element  is 
in  the  form  of  positive  numbers,  each  element  in  each  of  the  vectors  is  positive. 

Associated  with  each  input  vector  is  a  weighting  vector,  respectively 
-* 

Gj+,  Gj_,  Gg+,  and  Gg_.  The  output  R  is  computed  as  follows:  [B  is  a  bias  — 
used  as  the  exciting  or  inhibiting  mechanism,  M^  and  M^  are  weighting 
factors ] 


I  =  B  +  Mj(I+  •  G  +  -  I~  •  G  ) 

S  =  Mg(S+  •  Gg+  -  S“  •  Gg_)  (2.1) 


R  =  max(I  +  S,  0) 

^See  Neuron  Component  Development,  Semi-Annual  Report,  Contract  AF33(616 )-6805 , 
General  Electric  Co.,  15  June  1960;  Gracer  and  Orr,  Neuromime  Network  Simula¬ 
tion,  Final  Report,  Contract  AF33(657)-11194,  Service  Bureau  Corp.,  14  August 
1964;  Neural  Network  Simulator,  Final  Engineering  Report,  Contract  AF33(657)- 
8489,  Teledyne  Systems  Co.,  January  1965. 
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Sensory  Inputs 


Outputs 


Figure  2.1.  Computational  Structure  for  Self -or: anizin T  Computer 
Showing  Transmission  Lines  and  Computational  Areas 
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Logical  Diagram  of  the*  Steele  Neuromime 
Figure  2.2 


After  the  computation  of  R  ,  the  G's  are  altered  to  make  them  respond 
better  to  the  input  signals.  CFj.  Fg  are  weighting  factors,  A  is  a  vector 
with  the  same  nunber  of  elements  as  A,  each  equal  to  the  average  of  the 
elements  of  X.D 


GI+  =ri*R*(I+“I+)+  Gj+ 

GI“  =  "Fi  ‘  R  ‘  (I"  -  +  Gi“ 

Gg+  =  Fg  *  R  •  (S+  -  s+)  +  Gs+ 
Gg.  =  -Fg  •  R  *  (S*  -  §-  )  +  Gg_ 


(2.2) 


subject  to  sane  restraint  on  the  final  value  of  each  G  vector.  One 
restraint  considered  is  that  the  sum  of  the  elements  in  each  G  vector 
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should  remain  the  same,  another  is  that  the  sum  of  squares  of  the  elements 
in  each  G  vector  should  be  constant, 

A  computational  area  in  the  computer  is  composed  of  a  number  of  these 
elements.  Inputs  to  the  area  from  other  areas  or  from  the  outside  are 
connected  to  I  inputs  of  the  neuromimes.  The  S  inputs  are  connected 
to  the  outputs  of  neuromimes  in  the  same  area  to  give  feedback  to  the  system. 
The  inputs  B,  F  and  M  can  be  from  anywhere  in  the  system. 

The  overall  combination  of  computational  areas  is  called  a  net ,  and  a 
single  area  is  called  a  level.  The  organization  of  a  level  is  illustrated 
in  Figure  2.3.  The  I  inputs  to  each  component  are  taken  from  external 
sources,  while  the  outputs  of  some  of  the  neuromimes  are  connected  to  the 
S  inputs  of  others.  All  S  inputs  in  the  level  are  taken  from  outputs 
of  other  components  in  the  same  level.  The  external  sources  will  be  termed 
the  Input  Space. 

Parameters  affecting  the  computations  of  the  level  are  M^,  M^,  F^, 

Fs>  and  B.  F^  and  F^  are  adaptation  weights  which  have  no  immediate 
effect  on  a  particular  computation,  while  B  is  a  bias  acting  on  each 
component  of  the  level.  If  M  =  0,  then  each  of  the  components  performs 

O 

a  measurement  on  the  Input  Space  independent  of  all  other  measurements  in 
the  level,  except,  of  course,  of  possible  overlap  and  statistical  correla¬ 
tion  between  the  inputs  to  the  different  components.  When  is  nonzero 

a  degree  of  interaction  is  introduced. 

These  parameters  are  designed  to  yield  the  following  behavior  from  a 
level: 

1.  The  output  of  a  particular  component  can  be  either  increased  or 
decreased  by  the  output  of  a  neighboring  component. 

2.  The  degree  of  interaction  can  be  controlled  on  a  global  basis; 
that  is,  the  general  degree  of  dependence  of  the  outputs  of  components  on 
the  outputs  of  other  components  can  be  controlled. 

3.  The  degree  of  dependence  of  the  output  of  a  particular  component 
on  the  outputs  of  its  neighbors  can  be  controlled  on  a  local  basis. 

4.  The  overall  activity  of  the  net  can  be  controlled. 
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Inputs  to  level^+^ 


Figure  2.3.  Example  of  Neuromime  Net 
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Figure  2.4  shows  how  the  first  consideration  is  effected.  The 
component  C  has  as  an  S+  input  the  output  from  component  Nj ,  and 

as  an  S  input  the  output  from  component  N^.  A  large  output  from  N1 

will  raise  the  output  of  C,  and  a  large  output  from  will  lower  the 

output  of  C.  The  degree  of  interaction  is  given  by  the  ratio  of  M  to 
Mg.  If  M^/Mg  >:>  there  will  be  little  interaction,  if  M^/Mg  1, 

there  will  be  a  great  deal  of  interaction. 

Mj  and  Mg  are  parameters  common  to  each  component  on  a  level.  An 
adjustment  in  their  ratio  therefore  increases  or  decreases  the  interaction 
generally. 

It  is  necessary  to  make  use  of  a  special  kind  of  interconnection  to 
obtain  the  third  consideration.  In  Figure  2.5,  we  see  that  the  output  of 

component  N  is  connected  to  both  the  ST  and  S  inputs  of  component  C. 

The  degree  of  interaction  is  thus  dependent  on  the  Gg  set  of  C.  The 
effect  of  N  on  C  is 


V*1 


(2.3) 


+  .  *>+  -  .  ■>- 

where  g.  is  an  element  of  Gc,  g.  is  an  element  of  Gc,  and 

1  +  b  3  b 

is  connected  to  the  ith  S  input  and  to  the  jth  S  input. 

Initially,  gt  =  g.,  so  that  there  is  no  interaction.  If  F 
is  nonzero,  however,  the  Gg  sets  will  be  altered  in  the  same  manner 
as  the  Gt  sets  are  altered  with  F^.  A  positive  Fg  will  increase 
gt  and  decrease  g.,  making  the  interaction  of  N  and  C  positive, 
and  a  negative  Fg  will  have  the  reverse  effect,  thus  altering  the  local 
degree  of  interaction. 

The  bias,  B,  which  can  be  either  positive  or  negative,  is  a  device 
to  adjust  the  activity  threshold  of  each  component  in  the  level.  If  this 
bias  is  low,  the  level  will  tend  to  be  "aware"  of  only  those  inputs  for 
which  it  is  set  up  to  give  a  maximal  response.  Conversely,  if  the  bias 
is  high,  the  activity  will  be  increased,  and  the  level  will  tend  to  be 
"aware"  of  inputs  which  it  formerly  ignored  (because  R  <  0) 

For  further  discussion  of  the  action  of  a  level,  it  is  useful  to 
have  in  mind  the  mathematical  formulations  and  geometric  models  developed 
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in  the  following  chapters.  Host  of  the  work  has  been  for  the  special  case 
of  constant  inputs  to  a  level,  although  in  the  last  chapter  some  preliminary 
formulations  are  given  for  time  varying  inputs. 
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Figure  2.4.  Sample  Net  showing  Positive 
and  Negative  Interaction 


Fi  jure  2.5 


Example  of  Interaction  Alterable  to  either 
Positive  or  Negative  by  G^-set  Adaptation 
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Section  III 


COMPUTATIONS  PERFORMED  BY  A  SINGLE  NEUROMIME 


A  neuromime  can  be  considered  as  an  adaptive  measuring  device  on  an 
input  space.  Its  input  consists  of  2n  lines,  n  of  which  are  attacned 
to  the  I+  inputs,  and  n  to  the  I  inputs.  These  lines  can  be  con¬ 
sidered  as  a  2n-dimensional  time  varying  vector  function  x(t),  whose 
first  n  components  are  inputs  to  I+  and  the  last  n  to  I  .  The  com¬ 
ponents  of  x  will  be  denoted  as  x  x  ,  x  . x  .  For  tne 

present,  we  will  consider  x(t)  to  be  constant  over  a  period  t ^ ,  cnanging 
as  step  functions.  A  typical  component  might  have  values  as  in  Figure  3.1. 
The  values  of  the  components  are  restricted  to  be  between  0  and  1. 


Typical  Sequence  of  Values  for  a  Single 
Input  to  a  Neuromime 

Figure  3.1 

Within  the  neuromime  are  stored  a  set  of  weights  corresponding  to 
the  inputs.  These  weights  may  be  denoted  by  the  vector  elements 
gl,,',,Sn*  gn+l,’’’,g2n  *  Since  in  Equation  (2.1)  the  last  n  g-elements 
are  subtracted,  it  makes  sense  to  consider  in  the  following  the  vector  g 
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g(t)  = 


(3.1) 


For  a  beginning  in 
of  the  neuromime,  with 
by  (2.1)  reduce  to 


the  discussion  we  will  consider  the  simplest  case 
Mg  =  0,  B  =  0  and  M ^  =  1.  The  computations  given 


-*■  -+ 

R  =  x  «  g 

R*  =  max(R,0) 


(3.2) 


The  restriction  on  the  g  elements  considered  in  this  report  is  that 


2n 


l  i,  ■  i 


i=nti 


(3.3) 


There  is  a  further  restriction  on  each  g-element 

0  <  gi  <  ^  i  =  1,... ,2n  (3.4) 

for  a  given  integer  p.  This  latter  restriction  means  that  a  g-element 
can  saturate  so  that  it  cannot  be  increased.  Its  effect  may  be  seen  by 
considering  the  adaptation  process  in  Figure  3.2 

Figure  3.2  gives  an  example  of  the  adaptation  for  a  particular  input 
on  the  excitatory  inputs  only.  Part  a)  shows  the  input  elements.  Part  b) 
shows  the  deviation  vector  of  Equation  (2.2) 


which  is  used  to  guide  the  adaptation.  Part  c)  shows  the  g-set  before 

adaptation.  Note  that  g  and  g  are  saturated.  Part  d)  shows  the 
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a)  Input 


b)  Deviation 
x  -  x 


c)  g-set  before  adaptation 


+  + 


d)  g-set  after  adaptation 
g  elements  marked  ! 
have  saturated 


gl  g2  g3  g4  g5  g6  g7  g8  g9  g10 


e)  Terminal  g-set 
which  would  be 
unchanged  by  this 
adaptation 


Figure  3.2.  Example  of  Excitatory  Adaptation  with  =  .1 
and  Showing  Saturation 
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g-set  after  adaptation,  with  an  indication  of  the  g-elements  which  were 
unable  to  adapt  by  the  full  amount  because  of  saturation.  There  is  a 
mechanism  in  the  simulator  which  adjusts  for  this  occurence.  Part  e) 
shows  the  g-set  as  it  would  be  if  this  input  were  repeated  sufficiently. 

Using  the  deviation  vector  rather  than  the  input  vector  in  this 
adaptation  is  justified  by  the  normalization  criterion  that  the  sum  of  the 
g-elements  remain  constant.  The  sum  of  the  d-elements  is  zero,  so  that 


n 

l  (Si 

i=l 


w 


n 


n 


l  Si  +  F  R  l  d 
i=l  i=l 


n 


(3.6) 


Note  that  in  Equation  (2.2),  the  inhibitory  deviation  'Vector  is  subtracted 
from  the  inhibitory  g  vector. 

The  terminal  g-set  is  of  interest  because  it  in  a  certain  sense  picks 
out  those  inputs  of  most  significance  and  ignores  the  rest.  On  the  excitatory 
side  it  picks  out  the  p  largest  inputs  (those  p  which  have  the  least 
negative  deviation),  and  on  the  inhibitory  side  it  picks  out  the  p  smallest 
inputs  (those  which  have  the  least  positive  deviation).  The  component  will 
give  an  output  only  if  most  of  the  significant  excitatory  inputs  are  present 
and  most  of  the  inhibitory  inputs  are  absent.  Figure  3.3  shows  three  devia¬ 
tion  vectors  which  result  in  the  same  terminal  g-set. 

In  practice,  the  g-sets  will  rarely  achieve  their  terminal  state,  but 
will  fluctuate  about  some  intermediate  configuration.  In  the  operation  of 
the  net,  the  first  input  will  generally  find  a  neuromime  for  which  it  gives 
a  high  response.  This  neuromime  (neuromime  A)  will  adapt  its  g-set  more 
toward  this  input  than  will  any  other,  even  though  many  neuromimes  will 
produce  an  output  and  will  adapt  to  a  degree.  For  subsequent  inputs  which 
differ  from  the  first,  neuromime  A  will  generally  give  a  small  output  and 
will  adapt  toward  this  new  input  relatively  little,  while  others  will  adapt 
more.  If  the  first  input  recurs  reasonably  soon,  neuromime  A  will  have  an 
even  larger  output  relative  to  the  others  than  before,  and  will  continue 
this  differential  adaptation.  The  first  input,  however,  will  probably  not 
recur  often  enough  to  drive  neuromime  A  to  a  terminal  state. 
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Deviation 

Vectors 


a.  5  elements  positive 
5  negative 


.6 


0 


-.6 


b.  4  positive 
-  6  negative 


4  negative 


d.  Common  Terminal 
G-set 


Three  Deviation  Vectors  Resulting  in  the  Same 
Terminal  G-set 


Figure  3.3 
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Section  IV 


GEOMETRICAL  MODEL  OF  COMPUTATIONS 
PERFORMED  BY  A  SINGLE  NEUROMIME 


Because  of  the  vector  nature  of  the  operations  in  the  neuromime,  it 
makes  sense  to  look  upon  them  in  a  geometrical  model.  The  adaptation 
restrictions  given  by  equations  (3.3)  and  (3.4)  indicate  that  this  model 
is  related  to  a  higher  dimensional  octahedron.  A  discussion  of  the 
formulation  of  this  concept  and  an  introduction  to  the  notation  and  lan¬ 
guage  of  this  model  is  given  in  Appendix  I.  The  following  discussion 
assumes  that  the  reader  is  familiar  with  this  material. 

Since  the  terminal  state  of  a  neuromime  is  relatively  simple  compu¬ 
tationally,  we  will  develop  the  model  from  it.  We  will  assume  that  the 
normalization  is  octahedral,  as  in  equations  (3.3),  (3.4)  and  will  con¬ 
sider  only  the  I+,  I  inputs  by  setting  M  =  B  =  0.  We  will  speak 
of  the  G-set  as  divided  into  two  vectors  G+  and  G  with  elements  (g^l 

and  {g^},  respectively.  Each  of  the  vectors  I+,  I  ,  Gr ,  G  will  hav? 
n  elements. 

The  condition  that  the  G-set  is  terminal  means  that  each  gt*  g^  equals 

either  0  or  meaning  that  there  are  only  p  nonzero  elements  in  each 

+  —  P  4-  +  _  — 

of  G  ,  G  ,  denoted  by  {g^  , . . .  ,g^  }  and  {g,.  ,  ...,g.  }  respectively.  Tne 

computation  of  the  neuromime  from  ^(2.1)  reduces  to  * 


I  = 


f  gt  X  -  £  g+  X 

fc=l  Z  Z  Z=1  JZ  Jl 


=  M, 


f 


x . 

3. 


tSi  p 


4=1 


(4.1) 


U  -  £ 

£=1  l  £=1  J£ 


We  can  further  assume  that  the  sets  {i^}  and  {j^}  have  no 
numbers  in  common,  so  that  we  can  consider  the  sum  above  to  be 
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I 


(4.2) 


M1 


L  x* 
£=1  Xl 


f  *. 

£=ptl  Xi 


Since  the  output  R  is  computed  by 

R  =  max(I ,  0) , 


we  can  say  that 


R  >  0 


if  and  only  if 


A=1 


S,=ptl 


(4.3) 


The  Input  Space  is  a  2n-dimensional  Euclidean  space,  restricted  to  the 
principal  section,  and  the  points  for  which  R  >  0  are  divided  from  those 
where  R  =  0  by  the  hyperplane 


x.  + 

l 


1 


+ 


x.  + 

Vi 


+  X  . 
1 


2p 


(4.4) 


which  is  a  division  of  the  principal  section  into  two  regions.  If  we 
restrict  the  inputs  so  that 


K  .x 

£=1  £ 


(4.5) 


we  find  that  we  have  restricted  the  inputs  to  lie  in  the  interior  and 

surface  of  2n-dimensional  octahedron.  Furthermore,  we  can  restrict  our 

attention  to  that  2p- dimensional  octahedron,  Q,  determined  by  the  vertices 

v.  ,  which  is  also  divided  by  the  (2p-l)-dimensional  hyperplane, 

1 1 


or 


+  x. 

l 


=  X. 
2 


+  X. 


P  +  1 


2p 


(4.6) 


x.  +  •••  +  x.  -  x.  -  •••  -  x.  =  0. 
Xl  Xp  Xp+1  X2p 


A  the  region  where  R  >  0,  and  by  A  the  region 


We  will  denote  by 
where  R  =  0. 
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In  the  region  A,  R  considered  as  a  function 
R:  I+  x  i"  x  G+  x  G~  ->•  (0,  1] 

defines  a  set  of  equivalence  classes,  one  for  each  possible  value  of  R. 
Geometrically,  this  equivalence  class  is  a  (2p-l)- dimensional  hyperplane 

x.  +  •••  +  x.  -  x.  -  •••  -  x.  =  y  for  0  <  y  *  1.  (4.7) 

1  p  p+1  12p 


Hy  is  parallel  to  HQ  . 

An  example  of  this  for  the  4-dimensional  case  is  given  in  Figure  4.1. 
The  figure  shows  the  intersection  of  the  3-diraensional  hyperplanes  Hq  and 
with  the  face  of  the  octahedron.  The  limiting  case  is  the  2-dimensional 
hyperplane  xj  +  x2  =  1,  X3  =  X4  =  0,  which  is  a  line  in  the  xjx2  coordi¬ 
nate  plane.  This  is  shown  as 

For  the  2p-dimensional  case , 


Hj  in  the  figure. 


is  given  by  the  equations 


x.  + 


+  x.  = 
1 

P 


It 


(4.8) 


and  is  thus  a  (p-l)-dimensional  hyperplane,  a  degenerate  member  of  the 
(2p-l)-dimensional  family  H^.  As  such,  it  is  also  parallel  to  H0  • 

We  will  denote  the  vector  (x.  ,...,x.  )  as  £*.  We  can  thus  interpret 

1  2p  ->• 

R  as  a  "distance"  measure  of  the  hyperplane  containing  the  input  point  x* 
from  a  reference  hyperplane  H  .  R  will  be  1  when  x*  e  H1  and  0  when 
x*  e  A.  A  response  R  =  1  does  not  identify  the  input  as  a  particular  point, 
but  restricts  it  to  a  (p-l)-dimensional  hyperplane.  In  the  example  given  in 
Figure  4.1,  points  (1,  0,  0,  0)  and  (0,  1,  0,  0)  are  both  contained  in  Hi  . 
An  interval  0<a<R<b<l  ((interval  (a,  b))  defines  a  neighborhood  in 
the  space  Q  as  the  2p-dimensional  volume  bounded  by  the  (2p-l)-dimensional 
hyperplanes  and  H^.  This  is  a  kind  of  "slice"  of  the  octahedron  Q,  so 

that  two  points  can  be  in  the  same  neighborhood  even  though  their  Euclidean 
distance  from  each  other  might  be  very  large,  and  can  be  in  different  intervals 
even  though  it  might  be  small. 
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<7*0,0  i)  (i,0, 0  i) 


Figure  4,1  Four-dimensional  Case--Intcrsection  with  Face  of 


x1+x2-x^-x^=y  showing 
y  =  o,  y  =  7*  y  =  i 


H 

y 


for 
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The  more  complicated  non-terminal  case  can  now  be  treated.  Equation 
(2.1)  now  reduces  only  to: 


n 


/  g.x.  -  /  g.x. 

i=l  i=n+l 


2n 

l 


Setting  R  =  R'/Mj.,  we  have 


R  =  g,*. 


+  g  X 

n  n 


gn+lXn+l 


—  o  o  c 


g2nX2n 


(4.9) 


In  addition,  g  is  altered  after  the  computation  as  follows: 
g  =  T  •  R  •  d  +  g  (4.10) 

where 


and  F  is  a  constant  parameter.  For  the  present,  we  can  take  F  =  1. 

An  example  of  this  computation  iterated  until  the  G-sets  become  terminal 
is  given  in  Table  4.1  for  n  =  2  and  p  =  1. 
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If  both  the  g.  and  x.  are  restricted  as  Before  to  the  interval 
&i  1 

[0,  1]  equation  (4.9)  for  R  =  0  determines  a  (2n-l)-dimensional  hyper¬ 
plane,  H  ,  dividing  the  principal  section  of  a  2n-dimensional  octahedron. 
This  hyperplane  always  contains  the  line  x^  =  =  ***  =  x2n  since 


+  gn  gn+l  + 


+  g2n  1 


by  the  octahedral  normalization;  and  for 


X1  =  —  =  x2n  =  a* 


gxa  + 


*na  -  W 


a.  —  « ©  © 


'  g2na 


=  a(gj 


gn  "  gn+l 


—  *  •  * 


"  g2n* 


=  a(l  -  1)  =  0,  (4.11) 

for  any  a.  The  intersection  of  H  with  the  order  face  therefore 

contains  the  point  x.  =  •  •  •  =  x0  =  .  The  line  x.  =  x^  =  •••  =  x0 

i  2n  2n  12  2n 

will  be  denoted  by  L. 

The  orientation  of  H  can  be  determined  by  considering  the  inter¬ 
sections  of  H  with  the  2n<^  order  faces  (edges)  of  the  octahedron.  An 
edge  (Vp  v.)  is  described  by  the  set  of  2n  -  1  equations 


x.  +  x.  =  1 
i  3 

x^  =  0  for  l  =  1,  2,...,2n 
£  /  i  or  j 


The  intersection  of  H  with  (v.  ,  v.) 
addition  of  one  of  the  equations 


is  the  point  resulting  from  the 


giXi  +  gjgj  =  °  (4.13) 

giXi  "  gjXj  =  0  (4.14) 

to  the  set  describing  (v^,  v j ) ,  making  2n  equations  in  all.  Equation  (4.13) 

appears  if  i  and  j  are  both  either  greater  than  n  or  less  than  n  +  1; 
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Table  4.1 


*Indi cates 


Sample  Iteration  of  a  Four- dimensional  G-set 


X1 

i+ 

x2 

.4 

.6 

++ 

d 

D 

1 

2 

-.1 

.i 

Iteration 

G+ 

gl 

g2 

1 

.5 

.5 

2 

.49 

.51 

3 

.48 

.52 

4 

.47 

.53 

5 

.45 

.55 

6 

.43 

.57 

7 

.42 

.58 

8 

.40 

.60 

9 

.38 

.62 

10 

.36 

.64 

11 

.34 

.66 

12 

.31 

.69 

13 

• 

ro 

00 

.72 

14 

.25 

.75 

15 

.21 

.79 

• 

o 

• 

final 

0.0 

1.0 

I 


X3 

x4 

.6 

.2 

d4 

.2 

-.2 

G~ 

g3 

g4 

R 

.5 

.5 

.1* 

00 

It 

• 

.52 

.11 

.46 

.54 

.12 

.44 

.56 

.13 

.41 

.59 

.15 

.  38 

.62 

.16 

.35 

.65 

.18* 

.31 

.69 

.20 

.27 

.73 

.22 

.23 

.77 

.24 

.18 

.82 

tO 

CM 

• 

.13 

.87 

.29 

.07 

.93 

.32* 

.01 

.99 

.35 

o 

• 

o 

1.0 

.36 

o 

• 

o 

1.0 

.40* 

points  taken  for  example  in  Figure  4.3 
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Equation  (4.14)  appears  otherwise.  The  former  corresponds  to  the  cases 
where  x^  and  x^.  are  both  in  I+  or  i",  and  the  latter  to  the  cases 
where  x.  is  in  I+  and  x.  in  i"  or  x.  is  in  I~  and  x.  is  in 
i+.  31 

The  solution  to  (4.12)  t  (4.13)  is,  for  g.  >  0,  g.  >  0 

1  ’  °i 


x. 

l 


* 


x. 

3 


(4.15) 


This  solution  lies  outside  the  principal  section  of  the  octahedron,  since 
if  x^  is  positive,  must  be  negative,  and  if  g^  -  g^  is  negative, 

Xj  will  be  negative.  Similarly,  if  g^  -  g^  is  positive,  x^  will  be 
negative. 


H  and  (v^,  v. )  can  intersect  only  when  g^  =  0,  or  g_.  =  0,  or  both. 
When  g^  =  0,  the  solution  is  x^  =  1,  x.  =  0.  When  g^  =  0,  the  solution 
is  Xj  =  1,  =  0.  These  points  correspond  respectively  to  and  v  . 

When  g^  =  g.  =  0,  every  point  in  (v^,  )  is  a  solution.  Geometrically, 

the  first  case  occurs  when  H  passes  through  the  vertex  v^,  the  second 
occurs  when  H  passes  through  v ^ ,  and  the  third  when  all  of  (v^,  )  lies 

in  H. 


The  solution  to  (4.12)  t  (4.14)  is 


x.  =  — 
i  g 


i 


t  g  . 
i  ^3 


x .  =  — 
3  g 


+  g . 

i  *3 


(4.16) 


which  clearly  lies  in  the  principal  section.  The  point  (x^,  xn)  will 

lie  closer  to  the  vertex  corresponding  to  the  lesser  of  g^,  g^f  and  will 

be  at  the  midpoint  of  the  edge  (v. ,  v.)  when  g.  =  g.. 

1  3  1  3 

These  solutions  can  be  characterized  by  saying  that  H  separates  I 
vertices  from  I  vertices.  In  other  words,  if  the  input  x  lies  on  the 
side  of  the  I~,  there  is  no  response,  and  if  x  lies  on  the  side  of  the 
I+ ,  there  is  a  response.  We  will  cpeak  of  H  dividing  the  principal 
section  into  a  response  region  and  a  non- response  region. 
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An  example  is  given  in  Figure  4.2  for  the  4-dimensional  case,  as  seen 
in  the  4**1  order  face.  H  intersects  the  face  in  a  plane  H'. 

I+  =  (xx,  x2),  i"  =  (x3,  x4),  gl  =  .5,  g2  =  .5,  g3  =  .7,  g4  =  .3. 

Note  that  H  intersects  the  edges  (Vj ,  v3>,  (v1 ,  v4),  (v2,  v3)  and 
(v2,  v4);  the  edge  (v1 ,  v^)  lies  on  the  response  side  of  H  and  the  edge 
(v3»v4)  lies  on  the  nonresponse  side  of  H. 

The  alteration  of  the  3-sets  results  in  a  rotation  of  H  about  the 
line  L.  In  the  face  this  becomes  a  rotation  about  the  center  point.  This 
alteration  can  proceed  until  the  terminal  state.  Recall  that  the  terminal 
state  for  the  g-sets  has  each  g^  equal  to  0  or  —  .  Taking  every  combi¬ 
nation  of  g.  and  g.,  we  note  the  only  possible  combination  of  values  are 
1  ] 

that  both  gj,  and  g^  are  zero,  that  both  are  equal  to  —  ,  and  that  one 
is  equal  to  zero  and  the  other  equal  to  ^  .  In  the  first  instance,  (v^,  v^. ) 
lies  entirely  in  H  ,  in  the  second,  H  intersects  (v^,  v  j )  at  its  midpoint, 
and  in  the  last,  H  intersects  (v^,,  v^)  at  the  vertex  corresponding  to  the 
g-element  equal  to  zero. 

An  example  of  the  rotation  process  for  n  =  2  is  given  in  Figure  4.3. 

The  sequence  of  numbers  is  that  given  in  Table  4.1.  Note  that  the  angle 
between  H  and  (v2,  v4)  tends  toward  90°  and  the  angle  between  H  and 
(vl,  v  )  tends  toward  0°  . 

The  angle  between  H  and  an  edge  (v^,,  v.)  is  of  interest  since  it 
tells  how  much  the  preponderance  of  x.  over  x.  affects  the  output,  which 
is  related  to  the  distance  of  x  from  H  .  As  the  projection  of  x  on  the 
x^x.  coordinate  plane  moves  along  the  edge  (v. ,  v.),  the  output  will  change 
as  the  cosine  of  the  angle  between  x  and  g  ,  the  normal  to  H,  as  in 
equation  (2.2).  If  the  angle  between  (v. ,  v.)  with  H  becomes  small,  any 
movement  of  x  primarily  in  the  x^x^-coordinate  plane  will  have  little  effect 
on  the  output,  since  this  movement  is  perpendicular  to  g  .  In  other  words  it 
does  not  matter  what  value  a  particular  x^  takes  if  the  corresponding  value 
of  gj.  is  near  zero. 


Since  the  parameter  p  is  the  same  for  both  G  and  G 


there  will 


always  be  the  same  number  of  nonzero  elements  in  both  sets.  The  terminal  case 
will,  there 
of  zero  g. 


will,  therefore,  always  have  an  even  number  of  nonzero  g^  and  an  even  number 


The  set  of  (v.,  v.)  such  that  both  g.  and  g.  are  zero 
i*  3  ^1^3 

determines  a  simplex  of  2(n  -  p)  dimensions.  This  simplex  has  as  vertices 
all  those  coordinates  which  contribute  nothing  to  the  input.  These  vertices 
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VI 


Figure  4.2 

Four-dimensional  Case  for  gj  =  .5,  g2  =  .5,  g3  =  .7,  g4  =  .3 
x  indicates  midpoint  of  edge . 
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Fi  upj  4.  3  Various  Stages  in  the  Rotation  of  H'  for  Four  Dimensions 


Data  taken  from  Table  1. 

Arrows  indicate  direction  of  movement  of  intersections  with  edges. 
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can  therefore  be  removed  from  the  model,  as  in  the  discussion  of  the 
terminal  state  model  in  Equation  (4.1)  and  following.  We  are  then  left 
with  a  2p^  order  simplex.  In  the  example  of  Figure  4.3,  the  terminal 
simplex  is  the  2n<*  order  (v^,  vip  • 

As  discussed  in  Section  HI*  the  terminal  simolex  eventually  reached  is 

given  by  the  p  largest  (smallest)  elements  of  I+  and  the  p  smallest 

(largest)  elements  of  I  ,  depending  on  whether  F  is  positive  or  negative. 

An  example  of  inhibiting  adaptation  is  given  in  Table  4.2.  Note  that  the 

adaptation  becomes  slower  as  R  decreases. 

Since  the  terminal  state  is  given  by  the  indices  of  the  p  largest 

elements  in  D+  and  the  p  smallest  elements  in  D  ,  a  large  class  of 

inputs  will  give  rise  to  the  same  terminal  state.  This  class  is  given  by 

the  region  in  the  principal  section  determined  by  the  set  of  (2n  -  1)- 

dimensional  hyperplanes  described  in  detail  in  Appendix  I.  Each  primary 

region  in  Appendix  I  corresponds  to  one  of  the  possible  total  orderings  of 

(x^, . . . »x2n}.  One  of  these  primary  regions  can  be  characterized  by  the 

ordering  x-  >•••>  X;  .  The  class  of  inputs  which  gives  rise  to  the 
1  2n  ,  .  . 

terminal  simplex  given  by  (v*  ,...,v*  ,  v*  ,...,v:  )  is  a  union  of 

■*■1  p  p+1  2p 

primary  regions.  Any  ordering  is  included  in  the  union  in  which  all  x^  in 

(x4 ,,..., Xi  }  are  greater  than  any  other  x.  in  (x. ,...,x_};  and  all  x. 
**■1  a-p  i  i  i 

in  {x£  }  are  less  than  any  other  x.  in  {*n+i » • • • »x2n^‘ 

pt  1  iiK  1 

An  example  will  clarify  this  concept.  In  the  Figure  5  of  Appendix  I, 
a  4-dimensional  case,  there  are  24  primary  regions  characterized  by  the 
orderings 


Xi  >  x2  >  x3  >  x4 
X1  >  x2  >  X4  >  x3 
X1  >  x3  >  x2  >  x4 
X1  >  x3  >  x4  >  x2 


x4  >  x3  >  x2  >  X1 


The  terminal  simplex  is  (v2,  v4),  so  that  the  regions: 
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Table  4.2 


Inhibiting  Rotation  of  Example  of  Table  4.1  (F  =  -1) 


x 


1 

4 


!+ 


.6 


x 


4 


.2 


d 


l 


-.1 


.2 


D 


d 


4 

2 


G  G  R 


Iteration 

gl 

g2 

g3 

g4 

i 

.5 

.5 

.5 

.5 

.1 

2 

.51 

.49 

.52 

.48 

.09 

3 

.52 

.48 

.54 

.46 

.08 

4 

.53 

.47 

.55 

.45 

.07 

5 

.53 

.47 

.56 

.44 

.07 

6 

.54 

.46 

.58 

.42 

.06 

7 

.54 

.46 

.59 

.41 

.06 

8 

.55 

.45 

.60 

.40 

.05 

9 

.55 

.45 

.61 

.39 

.05 

10 

.56 

.44 

.62 

.38 

.04 

11 

.56 

.44 

.63 

.37 

.036 

12 

.556 

.444 

.637 

.36  3 

.034 

13 

.553 

.447 

.644 

.356 

.032 
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>  Xi  >  x3  >  x4 

>  X3  >  X1  '  X4 

>  x3  >  x4  >  X1 


X 

X 

X 


3 

3 

3 


"  X2  >  X1  >  X4 

>  x2  >  x4  >  xi 

>  X4  >  X2  >  X1 


(4.18) 


together  contain  all  inputs  which  give  rise  to  the  terminal  simplex  of 
Figure  4.3. 

When  there  is  a  sequence  of  different  inputs  to  the  neuromime,  the 
rotation  will  be  in  the  same  direction  for  each  one  provided  that  the  inputs 
all  lie  in  the  same  region.  If  some  of  the  inputs  lie  in  a  different  region, 
there  will  be  competition  between  the  tvjo  rotations,  and  the  attainment  of  the 
terminal  state  will  be  delayed.  The  state  which  will  predominate  is  not 
immediately  clear,  since  the  order  of  inputs  in  the  sequence  is  quite  signif¬ 
icant. 

These  unions  of  primary  regions  are  the  sets  of  inputs  which  the 
neuromime  considers  similar  in  a  certain  sense,  so  that  they  indicate  which 
classes  of  inputs  may  most  easily  be  identified  by  the  neuromime.  As  seen 
above,  an  attempt  to  force  the  neuromime  to  recognize  a  set  of  inputs  lying 
in  different  regions  will  result  in  varying  results  and  slower  adaptation. 

At  this  point,  it  is  useful  to  make  a  final  examination  of  the  vector 
aspects  of  the  computation  of  R  ,  especially  the  algebraic  aspects.  The 
definitions  of  the  vectors  involved  are  recapitulated  below: 
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X  = 


X 

\  n 


d  = 


we  will  need  the  identities 


m  = 


g  = 


r\ 

■ 

Sn 


-g, 


ntl 


i  / 

Vg2n  / 


(4.19) 


-b 

X  = 

-b 

d  + 

-b 

m 

(4.20) 

-b 

X  0 

II 

t  X> 

(d  + 

-b 

m)  • 

II 

f  X> 

-b  ~b  ~b 

|  d| 2  +  d  •  m 

(4.21) 

-b 

d  ° 

-b 

m  = 

X 

n 

I  d+ 

i=l  1 

t  x- 

n 

T  d7  =  0 

i=l  1 

(4.22) 

that 

-b 

x  • 

-b 

d  = 

Idl2 

-b 

(4.23) 

The  space  of  possible  x  vectors  is  a  vector  space  of  2n  dimensions, 

~b  ~b  ~b 

denoted  by  X  .  Each  x  e  X.  can  be  written  as  d  +  m,  and,  from  (4.22) 

-b  -b  -b 

d  •  m  =  0  .  The  components  of  each  vector  m  satisfy  the  constraints 


m. 


m  »  m 

n*  n+1 


m 


2n 


(4.24) 
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The  vectors  m  therefore  lie  in  a  space  M_  of  two  dimensions.  M 

2  2 

contains  the  zero  vector.  Since  each  component  of  each  d  satisfies  the 
constraints 


dl  +  "•  -1-  dn  =  dn+l  +  •"  f  d2n  =  0  •  (4-25) 

-> 

The  only  d  that  is  contained  in  NL  is  the  zero  vector.  can  therefore 

J  2  2n 

be  decomposed  into 

X2„  =  M2©D2n-2  >  <4'26> 

-> 

with  every  d  contained  in  D^n  ^  »  where  (?)  denotes  an  orthogonal  direct 
sum  as  in  Hoffman  and  Kunze  (1961). 

Considered  as  a  hyperplane  in  the  principal  section  of  the  octahedron, 
is  a  plane  passing  through  the  origin,  the  center  of  the  face,  and  the 
centers  of  both  the  simplexes  P  and  N  given  respectively  by 


x  + 
1 


+  x  =  a 


n  +1 


=  x 


2n 


=  0 


(4.27) 


and 


x  =  •••  =  x  s  0  ,  x  _  +  •  • 0  +  x0  =  a  (4.28) 

1  n  n+1  2n 

0<a<l 

In  other  words,  is  the  direct  sum  of  the  orthogonal  lines  Q  ,  Q1 

given  respectively  by 

X1  =  000  =  X2n  (4.29) 


and 


x,  = 


=  X 


n+1 


=  x 


2n 


x,  = 


n+1 


(4.30) 


^2n  2  therefore  is  a  hyperplane  perpendicular  to  — it  corresponds 
to  the  set  of  hyperplanes  parallel  to  Hq,  where  all  the  g-elements  are  equal. 
The  two  components  of  in  Q  and  Q1  can  be  thought  of  as  determining 
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respectively  the  distance  from  the  origin  to  the  face  to  be  considered  and 
the  distance  from  the  center  of  the  face  to  the  hyperplane  parallel  to  Hp 
containing  x 

Define  f  and  £  to  be  the  components  of  m  in  Q  and  Q1  respec¬ 
tively.  Note  that 


(4.31) 


is  the  vector  normal  to  H 
From  (4.29)  and  (4.31) 


and  is  not,  in  general,  normal  to 


H 


0 


g  =  0 


(4.32) 


From  (4.32)  and  (4.26) 


x  =  g-(f+£+d)  =  g-  £  +  g 


(4.33) 


and  if,  in  addition,  g  =  gQ 


noting  (4.30), 


a 


(4. 34) 


From  (4.30),  a  is  the  difference  between  the  average  of  the  xt  and 
the  average  of  the  ;  and  the  output  of  a  neuromime  is  never  less 
than  0;  so  that  when  a  <  0  ,  we  have  R  =  0  . 

This  implies  that  the  significant  parameter  in  determining  the  initial 
strength  of  an  input  is  the  difference  between  the  average  of  the  I+  and 
the  average  of  the  I  .  In  tables  4.1  and  4.2,  we  see  that  following  (4.34), 

x+  =  .  5 
x-  =  .4 

,  (4.35) 

a  =  .1 

R(0 )  =  a  =  .1 
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The  terminal  output  for  positive  adaptation  is  given  by  the  difference 
between  the  average  of  the  P  largest  in  I+  and  p  smallest  in  I 
In  Table  4.1,  this  average  is  .4,  so  that 


R 


final 


.4 


(4.36) 


and  R  will  go  no  higher. 

For  negative  adaptation,  the  terminal  state  will  be  reached  only  if 
the  average  of  the  p  smallest  in  I+  is  greater  than  or  equal  to  the 
average  of  the  p  largest  in  I  ,  since  otherwise  R  will  go  to  zero 
before  the  terminal  state  is  reached.  In  the  example  of  Table  4.2,  this 
difference  is  -.2.  As  can  be  seen,  the  g-sets  are  nowhere  near  a  terminal 
state,  yet  the  output  is  very  near  zero.  The  adaptation  equations  developed 
in  Section  5  can  be  solved  to  get  the  g-set  for  which  R  becomes  zero.  These 
are  solved  in  Appendix  II. 

It  remains  to  consider  the  effect  of  the  bias  and  the  S  inputs  on 
this  geometrical  model.  These  additional  parameters  can  be  regarded  from 
Equation  (2.1)  as  constituting  an  additive  constant  to  the  measurement 
Equation  (4.1).  Equation  (2.1)  becomes 


c  =  B  +  S 

R  =  x  •  g  +  c  .  (4.37) 

->  -*■ 

Expressing  x  •  g  as  |x|  |g|cos  0  , 

setting  R  =  0  ,  and 

solving  (4.37)  for  0  ,  we  have 


cos  0 


-c 


(4.38) 


Angle  0  is  the  angle  between  the  x  vector  and  the  fixed  g  vector 
which  determines  the  locus  of  points  for  which  R  =  0  .  Any  x  whose  angle 
with  g  is  greater  than  0  will  not  produce  an  output.  Note  that  the  value 

-f 

of  this  critical  angle  depends  on  the  magnitude  of  x.  Figure  4.4  shows  a 
sketch  of  0  versus  |x|  . 
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90° 


Figure  4.4 

Sketch  of  Critical  Angle 


6  =  cos 


-1 


(  |l|  |J|  ) 


|s|  |*| 

for  c>  0,  c  =  0 ,  c<0 


Justification: 


d6 

dx 


1  - 


|x|2  |I|2 


-*■  0  as  I  x  I  -*■ 


For  c  >  0, 

0  =  180° 

when 

1*1 

c 

1*1 

For  c  <  0 

O 

o 

II 

CD 

when 

1*1 

_  c 

lgl 
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This 


As  might  be  expected,  for  c  =  0,  0=0  for  any  value  of  |x|  . 

corresponds  to  the  hyperplane  H  of  (4.11)  which  is  the  locus  of  x  or  - 
**>*  -*■ 

thogonal  to  g  .  When  c  <  0,  no  x  will  produce  an  output  if  its  magnitude 

i  ->  *  -*• 

is  less  than  c/|g|,  since  no  matter  how  close  to  g  it  is ,  the  output  can- 
not  overcome  the  negative  bias.  Conversely,  when  c  >  0,  every  x  whose 
magnitude  is  less  than  c/|g|  will  produce  an  output,  since  no  matter  how 
far  from  g  it  is ,  it  cannot  overcome  the  positive  bias.  In  both  cases, 
as  |x|  increases,  the  effect  of  the  bias  is  reduced  in  that  as  0  approaches 
90°  the  critical  value  approaches  that  for  c  =  0. 

The  effect  of  the  bias  is  to  change  the  volume  of  the  response  area. 

When  c  <  0,  the  volume  is  decreased,  and  when  c  >  0,  it  is  increased. 

It  is  somewhat  difficult  to  visualize  this  phenomenon  in  the  octahedron 
model,  but  the  hatched  areas  under  the  curves  of  Figure  4.4  give  some 
idea  of  how  it  works.  For  c  >  0,  the  response  area  is  greater  than  for 
c  =  0,  and  for  c  >  0,  the  response  area  is  less. 

When  the  bias  is  large,  the  threshold  hyperplane  can  be  rotated  by  the 
adaptation  process  so  that  when  the  bias  is  again  reduced  the  components  of 
the  level  will  give  maximal  response  to  new  inputs.  This  property  can  be 
used  to  cause  neuromimes  near  terminal  states  to  "forget"  their  conditioning 
so  that  they  can  be  used  for  new  tasks. 

As  described  in  Section  II,  the  S  inputs  are  interaction  inputs 
between  different  neuromimes  on  the  same  level.  There  is  an  adaption  process 
for  these  inputs  as  well  as  for  the  I  inputs,  which  was  described  in 
Section  II  when  there  is  only  one  neighbor  connected  to  both  an  S+  and  an 
S  input.  There  can  be  a  large  number  of  such  connections,  and  the  S 
g-sets  behave  much  like  the  I  g-sets.  It  makes  more  sense,  however,  to 
consider  the  S  inputs  in  a  model  as  in  Section  III  and  take  the  effect  on 
the  geometrical  model  as  one  of  additional  bias. 

The  adaptation  process  on  a  large  number  of  S  inputs  will  cause  the 
degree  of  interaction  to  alter  differentially  among  them,  so  that  the 
influence  of  those  neighbors  with  high  outputs  will  be  altered  differently 
from  that  of  those  with  small  outputs.  As  with  the  sets,  the  controlling 

factors  in  the  differentiation  are  the  dispersion  among  the  S  inputs  and  the 
maximum  allowable  value  for  an  individual  g^. 
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The  equations  involved  are : 

-* 

are  the  individual  components  of  S 


xt5' 

i 


Dg’-  =  S+’“  -  S+>" 


dt*  are  the  individual  components  of  D*’ 
1  b 


G**“  =  G**~  +  F„ 


R 


Ds’~ 


Si 


are  the  individual  components  of  G( 


p  is  the  number  of  g.  and  g.  allowed  to  reach  a  nonzero  terminal  j^tate. 

o  XX 

When  F  is  positive,  those  g.  corresponding  to  positive  d.  will  be 
b  i  +  1 

increased  and  those  corresponding  to  negative  d^  will  be  decreased,  while 

those  g^  corresponding  to  positive  d^  will  be  decreased  and  those  g^ 

corresponding  to  negative  will  be  increased*  The  terminal  state  is 

that  the  gt  corresponding  to  the  p  highest  dt  and  the  g.  correspond- 

ing  to  the  pQ  lowest  d.  will  be  equal  to  1/p  ,  while  all  other  g.* 

"*  o  1 


will  be  equal  to  zero* 


l 

When 


is  negative,  the  reverse  occurs. 


In  this  manner,  the  degree  of  interaction  among  a  component  and  its 
neighbors  can  be  limited  to  the  most  significant  (Fc  positive)  or  to  the 
least  significant  (Fg  negative)  for  a  particular  class  of  stimuli. 

It  should  be  noted  that  the  outputs  of  a  level  form  the  same  sort  of 
signal  space  as  the  inputs,  so  that  the  outputs  of  one  level  can  be  used  as 
inputs  to  other  levels* 
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Section  V 


ADAPTATION  IN  A  SINGLE  NEUROMIME 


A  question  of  some  importance  in  the  operation  of  a  neuromime  is  the 
way  in  which  the  adaption  process  takes  place  and,  in  particular,  how  the 
various  parameters  affect  the  adaptation.  We  will  again  deal  with  the 
problem  with  Mg  =  0,  but  will  allow  a  non-zero  bias  B.  Later  in  the 

section,  the  general  case  will  be  considered.  We  will  assume  throughout 
piecewise  constant  inputs. 

The  adaption  equations,  given  in  Equations  (2.2)  in  incremental  form  can 
be  expressed  in  differential  form,  using  the  notation  of  Section  III. 


S  G  =  FRD+ 
at 

A  -*■- 

—  G  =  -FRD 


(5.1) 


where,  using  the  notation  of  (4.9) 


n  2n 

R  =  M  I  £  g  x.  -  l  g.x 
\ i=l  i=n+l 


+  B 


The  Equations  (5.1)  may  be  expressed  as  the  system 


dt  gl  -  ™I<dlxlgl  *  *  dlVn>  *  ™I(-dlxlel  '  ' 


••  -dtx'g")  +  FBdt 
1  nton  1 


3f  %  =  rai<dMel  +  •"  *  dnxn4)  +  FMI(-dMgI  - 


-d+x"g~)  +  FBd+ 
n  nbn  n 


(5.2) 


d  - 
dt  gl 


F"i(dIxig!  t  •••  +  ™i(-dixigi  -  -dixX)  *  FBdi 


d  - 
dt  8n 


FHI(dn  x!gl  * 


+d~x+g+)  +  FM_(-d  xTg.  -  •••  -d  x  g  )  +  FBd 
n  n°n  I  n  ltol  n  nton  n 
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Adopting  the  definitions, 


as  in  Equation  (4.10) 


t 

where  v  is  the  transpose  of  v  (i.e.,  the  row  matrix  representation) 
and  the  operation  (£)  represents  matrix  multiplication. 

In  Appendix  III  the  solution  to  the  system  (5.3)  is  given  by 


Xt  .  FB  /c  .  v 

g  =  cue  t  k - r-  u  (5.4) 

* 

where  c  is  an  arbitrary  constant,  and  k  is  a  vector  of  2n  arbitrary 
constants  the  last  element  of  which  is  determined,  and 


\  =  u  •  v  =  FM, 


=  FM, 


=  FM, 


=  FM, 


n 

I 

i=l 

,+  +  r 

d.x.  +  )  d. 

l  l  . l 

1=1 

X  . 

1 

n 

l 

i=l 

dt  (dt  +x+)  + 

n 

I  < 

i=l 

iT(d- 

+  *-) 

"  n 

l 

i  =  l 

dt2  +  i  d?2 

1  i=i  1 

n 

+  x+  l 
i=l 

+  -  2  -1 
d.  +  x"  T  d. 

1  i=l 

~  n 

l 

i=l 

,  n 

d.  +  I  d. 2 
1  i=l  1 

(5.5) 

since  from  Equation  (2.5), 
n  n 


l  dt  =  l  d~  =  0 
.L „  1  . 1 


i=l 


i=l 
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It  will  be  convenient  to  simplify  the  notation,  removing  the 
superscript  +  and  -  ,  and  expressing  the  g^  as  functions  of  the 
time  t  ,  as  follows : 


=  g. 


gn+i  (t)  =  gI 


d.  =  d+ 

1  1 

d  .  =  d7 

n+i  1 


so  that  (5,5)  is  expressed  as 


for  i  =  1  , . . . ,  n 


2n 

A  =  FMj  l  d?  (5.6) 


The  summation  in  (5.6)  is  important  in  the  following,  and  will  be  referred  to 
by  the  symbol 


2n 

6  s  l  d? 

.  ,  i 


i=l 


so  that 


X  =  FM  6 


The' system  (5.4)  becomes 


...  ,  FMT6t  A  ,  B  . 

gl(t)  =  od^  I  *  di 


j  FMT6t  ,  B 

g(t)  =  cde  I  +k  -  -j— r-  d 
°n  n  n  M^.6  n 


g  ,(t)  ,  FMT6t  ,  B  J 

Bn+1  =  -cd  ,,e  I  -  k  +  7^-7-  d 

n+1  n+1  U_6  n+1 


o  o 

•  o 


...  ,  F«T6t  .  .  B 

g2n(t)  -Cd2ne  1  "  k2n  +  d2n 


(5.7) 


(5.8) 


(5.9) 
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9***9 


System  (5.9)  must  be  solved  for  the  constants  c  and 
by  evaluating  for  t  =  0  and  the  initial  values  of  the  g^(t)  , 


B 


gl(0)  =  cdl  +  kl  -  i^6  dl 


gn(0)  =  cdn  t  kn  -  g!L  dn 

W0)  =  -cdn+l  -  k„+l  +  A  d„tl 


s2n<0)  =  -cd2n  '  k2n  *  d2„ 


with  the  addition  of  the  constraint  on  k^n  from  Appendix  IIIj 


k 

n 


(5.10) 


FM  x  k  +  •••  +  FMtx  k  +  FMx  k  +  •••  +  FM,x0  ,k0  . 

Ill  Inn  I  n+1  n+1  I  2n-l  2n-l 

c2n  "  -FMtx„ 

I  2n 

x,k  +  •••  +  x  k  +  x  ,k  ,  +  •••  +  x^.  .k.  , 

11  n  n  n+1  n+1  2n-l  2n-l 


(5.11) 


2n 


Solving  (5.10),  we  obtain 


c 


El(0)  -  kl  +  iifs  dl 


(5.12) 


k. 

1 


Ei(0)  -  cdi  ^  HJJ  di 


k. 

1 


-gA(0) 


cd. 

X 


x  1  , . . . ,  n 
i  =  n+1  , .  . . ,  2n 


can  be  obtained  by  equating  the  expressions  for  k^n  in  (5.11)  and 
(5.12),  substituting  the  expression  (5.12)  for  k^  in  (5.11)  with 
i  =  2  2n-l;  and  substituting  for  c  the  expression  in  (5.12). 
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x.k_  n  x. 

1 1  *  i  — 

.  ^  -x* 


-x 


2n  i=2  2n 


2n-l  x. 

♦  I  — 

U  -V 


g^o)  - 


gi(0>  -  ki +  Hp- d! 


i=n+l  2n 


-gi(0)  - 


|g-L ( 0 )  -k±  +  M^6  dj 


di  +  M^6  dl 


d.  +  jp-  d. 
l  Mj.6  l 


=  -s2„(0)  - 


gl(°)  -  k1  +  dx 


d2n  +  Mj.6  d2n 


(5.13) 


Multiplying  both  sides  of  (5.13)  by  -<^x2n  expanding. 


n  r 


xlkldl  +  ^ 

i=2 


xigi(0)dl  '  xigl(0)di  +  xikldi  "  M^6  dlxidi  +  M^6  dlxidi 


2n-l 


♦  l 

i=n+l 


B  B 

-xig.(0)d1  -  xig1(0)di  +  x^k^  -  g-g*  d1xidi  +  g-g-  ^x^ 


(5.14) 


=  g2n(0)x2ndl  +  X2nSl(0)d2n  "  X2nkld2n  +  Mjfi  X2nd2ndl  “  Mj6  X2nd2ndl 

g 

Gathering  terms  in  (5.14)  noting  that  all  terms  containing  — drop  out, 
we  obtain 


2n-l 


lldl  +  .1  xidi  +  .  2  xidi  +  x2nd2n 


i=2  i=n+l 


dlx2ng2n(0)  +  gl(0>  x2nd2n  '  j2  dlxigi(0)  +  j2  gl<0)  xidi  (5-15) 


2n-l  2n-l 

♦  .  I  aixi*i(0)  *  .  I  ,  gl(0)  xidi 

i=n+l  i=n+l 
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The  left  side  of  (5.15)  can  be  simplified. 


n  _  * 

c  d  +  7  x.d.  +  T  x.d.  +  x0  dn 

1  1  ii  ii  2n  2n 


2n-l 

l 

i=n+l 


2n 


2n 


=  ki  I  Vi  =  kl  l  di  =  ki6  • 
i=l  i-1  1 

(5.16) 

The  right  side  of  (5.15)  can  also  be  simplified,  by  adding  and  subtracting 
the  term  g^  (O)x^d^  and  rearranging, 

gl(0)  xldl  +  sl(0)  l  xidi  +  gl(0)  .  1  ,  xidi  +  gl(0)  x2nd2n 

1=2  i=n+l 

n  2n-l 

■dl¥l(0)  _dl  l  Xigi(0)  +  dl  .  I  .  xigi(0)  +  dlx2ng2n  (0) 
i=2  i=n+l 


2n 


*  gil0)  l  xidi  -  di 

i=l 


n 


2n 


I  xigi(0)  -  l  xigi(0) 
i=l  i=n+l 


Defining,  according  to  (5.1), 
R(0)  =  M 


n  2n 

I  Vi<0)  -  .  J  xigi(0) 

i=l  i=n+l 


+B 


and  substituting  (5.16)  and  (5.17),  equation  (5.15)  becomes 
R(0)  -  B' 


kx6  =  g1(0)6  -  d± 


or 


ki  =  gi(0)  '  T 


M, 


R(0)  -  B 


M, 


Substituting  (5.20)  into  the  expression  in  (5.12)  for  c. 


c  = 


d! 

cr  f  n  ^  f  n  ^  l.  i  m  -r- 

o 

1 

,  B 

t  — 

MT 

M  6  dl 

L  I  J 

I 

(5.17) 


(5.18) 


(5.19) 


(5.20) 


(5.21) 


or,  simplifying, 

c  -  ijij  cxo)  -»♦»]■ 


(5.22) 
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Substituting  the  expression  for  the  from  (5.12)  into  the 

system  (5.9)  we  obtain  the 'solution  to  (5.1), 


gx(t) 


FMT  6t  ,  _  , 

cd^e  I  +  g1(0) 


-  cd. 


s i  \  ,  FM_6t  f 

g  (t)  =  cd  e  I  +  g  (0)  -  cd 

fen  n  Bn  n 


■gn+l(t)  *  c<WeF"l6t  -  en+l  <0)  *dcdntl 


(5.23) 


-«2n(t)  *  cd2ne™lSt  *  g2n<0)  *  cd2n 


In  a  vector  notation,  using  the  definitions  in  (5.1),  (5.23)  is 


G+(t )  =  cPemiH  ♦  G+(0)  -  CD* 
-G-(t)  ,  cS-e™!61  -  G~(0)  -  c 5- 


(5.24) 


and  in  the  form  of  the  system  (5.3);  (5.23)  is,  substituting  (5.22)  for  c. 


R(0)  +  FMT6t  .  R(0)  -► 

g(t)  =  rr-r-  ue  I  +  g(0)  -  U 


M  6 


K  6 


(5.25) 


or, 


;<t>=  ;  <e™i6t .  i,  tg(o) 


(5.26) 


Note  that  there  are  two  basic  controlling  factors  in  (5.26),  an 
exponential  term  and  a  constant  multiplier  term.  The  exponential  term 


X  =  F  M  6 


(5.27) 


is  controlled  by  the  variance  of  the  input  and  both  the  input  level  weight 
and  the  learning  factor.  The  constant  term 
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R(0)  _  V  '  S<0)  +  B 

MjS  "  M  6 

_  x  »  g(0)  B 

6  +  Mj6 


(5.28) 


is  dependent  mostly  on  the  bias  term,  since  x  •  gQ  can  be  of  the  same  order 
of  magnitude  as  6. 

We  are  now  in  a  better  position  to  attack  the  general  problem  with 
IAC  i  0.  There  are  several  possible  ways. 

One  important  consideration  is  that  we  are  interested  in  interconnec¬ 
tions  of  the  type  used  in  Section  II  so  that  we  want  to  determine  the 
adaptation  of  the  m-element  vector 


This  can  be  determined  by  differentiating  (5.29) 


(5.29) 


±  t  _  JL  g+  JLr- 

dt  K  "  dt  GS  "  dt  GS 

(5.30) 

=  Fg  R(S+  -  S+)  -  [-Fg  R(S-  -  S")] 


If 

-y  ~y 

s =  s"  =  s 

.  th 

and  is  the  i  element  of  S  . 

Equation  (5.30)  reduces  to 


K(t)  =  2FgR(S  -  5)  (5.31) 

The  simplest  way  of  solving  (5.31)  is  to  consider  the  solution  of  the 
system  for  =  0,  then  solve  (5.31)  with  R  held  constant  and  the  feedback 
delay  of  the  net  large  enough  so  that  the  value  of  S  remains  constant.  The 
solution  is  obtained  immediately.  Define 

E  =  S  -  i  .  (5.32) 


Then 


K(t )  =  2F  REt 


(5.33) 
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A  more  realistic  solution  is  to  assume  that  R  varies  but  that  S 
remains  constant.  In  this  case,  following  the  same  arguments  as  in  (5.1),  the 
solution  is 


K(t)  =  E(eXt  -  1)  +  K(0) 

KS62 


(5.34) 


where 


and 


X  =  2FsMs62 
m 

60  =  e? 

2  >,i 

1=1 


e^  is  the  it31  component  of  E 


(5.35) 

(5.36) 


The  interactions  between  the  two  adaptations  can  be  examined  by 
solving  the  two  systems  simultaneously.  The  complete  system,  (2.2),  using 
the  notation  of  this  section,  is  given  by 


a?i(t)  *  Vi  Mi 


n  2n 

l  x--,gi(t)  ‘  £  x-jS-j(t) 

j=l  J  J  j=n+l  J  J 


m 


+  B  t  M_  J  s.k.(t) 
s  j=i 3  3  j 


for  i  =  1, . . . ,n 


f  n  2n 

l  x-g^(t)  -  l  x  g  (t) 


■~r  g.(t)  =  FTd.  I M  i 

dt  &i  I  i  I  I  ,L.  .  **  3W3 

L  [_3  =  1  ]=n+l 


m 


B  +  M  1  s.k  (t)l 

Si=l  3  3  J 


for  i  =  n  +  1 , . . .  ,2n 


dt  i 


2n 


^rk.(t)  =  2F  e./M  l  x.g.  (t)  -  £  x.g.(t) 

L  y=1  J  J  j=n+l  J  J 


m 


+  B  +  M  l  s  k  (t) 

S  3=1  3  3  J 


(5.37) 


for  i  =  1,. . .  ,m. 

Following  (5.3)  system  (5.37)  can  be  expressed  as 


r^-y(t)  =  u©vt0y(t)  +  bu 


-y  -y 


(5.38) 
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where 


and 


b  =  B 


so  that  the  solution  is  given  by  Appendix  III  as 


where 


Letting 
5,  = 


-*■  At 

-+ 

B 

1  =  cue 

+  z  - 

’  A  U 

u  •  V 

2n 

m 

ftmt  y 

d?  + 

2F.M- 

l  e? 

1 1  .l. 

i 

S  S 

. L .  l 

1=1 

i=l 

2n 

m 

l  d?  , 

62  = 

l  e 

2 

;  > 

i=l  1 

i=l 

i 

FTMT6,  + 

2F0M0 

III 

S  S 

i  2 

argument 

from 

(5.9) 

through 

R(0) 

A 

^IX1 


Mtx 
I  n 


MTx  . 
I  n+1 


MIx2n 

Mssi 


Mcs 
S  m 


(5.39) 


(5.40) 


(5.41) 


(5.4;:) 
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where 


R(0)  =  y  •  x  +  B 
and  the  solution  is 


;<t,  .  i,  t  *<„>  . 

X 


(5.43) 


(5.44) 


The  interaction  between  the  two  systems  can  be  seen  by  examining  in 
detail  the  solutions  for  g^(t)  and  k^(t). 


F].R(0)d1(e/Xt  -1) 

gl(t)  =  +  2FsMg62  +  gl(0) 


(5.45) 


2FsR(0)ei(eXt  -  1) 

kl(t)  ”  FtM_6,  +  2F„M„6,.  +  ki(0)  * 


(5.46) 


III 


S  S  2 


Again,  we  see  that  the  variance  and  the  weighting  parameters  control  the 
exponential  factor,  and  the  bias  is  the  main  control  in  the  multiplier 
factor,  with  the  additional  control,  the  ratio  of  the  F^  and  Fc  weights. 

We  are  now  prepared  to  give  some  answer  to  the  question  of  the  neuromime 'c 
"memory".  If  a  stimulus  is  presented  and  the  g-sets  altered  according  to 
that  stimulus  for  a  time  t,  then  another  stimulus  is  presented  and  the  g-sets 
altered  again,  how  much  of  the  original  adaptation  remains? 

The  output  of  the  neuromine  at  time  t  after  the  beginning  of  the 
adaptation  is,  from  (5.1) 


R(t )  =  MJg(t )  •  x  +  B 
substituting  (5.26)  for  g(t), 


(5.47) 


R(t )  =  M. 


,  Xt  ,  N  R(0)  -*■  ’+/n\ 

(e  “  1}  M^6“  u  +  g(0) 


x  +  B 


=  (eXt  -  1)  u  •  x  +  M_g(0)  •  x  +  B 

6  1 


=  R(0)e 


Xt 


(5.40) 
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We  will  consider  an  input  followed  by  an  input  x^ 

then  examine  the  effect  of  a  repetition  of  x^  .  Parameters 
(5,26)  will  be  subscripted  according  to  the  input  concerned. 
(5.48) 

At 

Rx(t)  =  R^COe 

R-j(o)  X  t 

gl^  =  (e  "  1)dl  +  gl(0) 

R2(0)  =  M^t)  •  x2  +  B 
^2t 

R2(t)  =  R2(0)e 

R9(0)  X  t  .+  -> 

g2(t)  5  <e  '  1)d2  +  sl(t)  • 

Applying  the  input  x^  again,  we  have 

R3(0)  =  Mjg^t)  •  X1  +  B  . 

The  variable  of  main  interest  is  the  loss  of  adaptation 

L  =  R3(0)  -  Rx(t)  . 

Substituting  the  expressions  in  (5.50)  and  (5.49),  into 
L  =  R3(0)  -  R1(t) 

=  Mjg2(t)  •  x1  +  B  -  R1(t)  . 


Substituting  for 


g2(t). 
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,  and  will 
from  (5.48)  and 
We  have,  from 


(5.49) 


(5.50) 

(5.51) 
( 5 . 51) ,  we  have 


L  =  M, 


V°>  V  »  . 

(e  -  l)u2  1-  6l(t) 


X1  +  B  ”  Rl^t^ 


X2t  uO  *  X1 

=  (MJg1(t)  •  x2  +  B)(e  -  1)  — g - -  +  R^(t)  -  R1(t) 

Substituting  for  g^t),  noting  that  R  (t)  drops  out,  and  expanding. 


L  = 


X  t  u 

Rj^tOXe  1  -D—tHjg^O) 


1  V  U2  - 

x  +  B  (e  -  1)  j-  •  x 

2  J  62  1 


V  “l  •  *o  +  + 

R1(0)(e  -  1)  -— 6----  +  Mjg1(0)  •  x2  +  B 


(e  -  1  )t--  •  k 
S2  1 


From  the  formulation  in  Equation  (4.26),  we  know  that 


U1  ‘  *2  *  U1  ‘  (u2  +  m2)  *  “l  ’  “2  ’ 


-+  ^  ~b  + 

u2  °  Xx  =  u2  •  (ux  +  m1)  =  u2  •  uL 


Defining 


(5.52) 


(5.53) 


Mlgl(0)  •  x2  +  B  =  R'(0)  , 


(5.54) 


and 


X.t 


X„t 


=  (e  1  -  1)  ,  c2  =  (e  2  -  1) 

we  have,  substituting  (5.53),  (5.54)  and  (5.55)  into  (5.52), 


(5.55) 


L  = 


u  •  u 

Rl(0)cl  1  6  ""  +  R,(0) 


U2  *  U1 


'2  6, 


(u..  •  u.)z  uT  •  u 

■  V0)V2  - *  c2r,(0>  —q~ 


(5-56) 
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The  dot  product  in  (5.53)  can  be  expressed  in  terms  of  the  lengths  of 
the  vectors  and  the  cosine  of  the  angle  between  them  [5]  as 


u. 


(5.57) 


and  since  6  =  |u|,  •  Ug  =  /6^  cos0  , 

we  have,  from  (5.56)  if  R(0)  =  r^  and  R'(0)  =  r^  , 


L 


rlClC2COS 


0  + 


r2C2 


(5.58) 


The  adaptation  loss  for  x^  after  having  applied  depends,  therefore 

in  great  part  on  the  angle  between  the  two  deviation  vectors ,  which  is  reason¬ 
able  since  the  adaptation  of  g  is  controlled  by  the  deviation  vector.  Some 

insight  into  the  distribution  of  L  may  be  gained  by  looking  at  the  distri- 

2 

bution  of  cose  and  cos  0.  If  we  assume  that  the  lengths  of  the  deviation 
vectors  are  equal,  we  can  imagine  d.  as  the  north  pole  of  a  2n-dimensional 
hypersphere,  and  examine  the  angle  0  between  d^  and  d^  .  An  element  of 
surface  content  of  this  hypersphere  is  derived  in  Appendix  IV  as 


dS  =  (pdOMpsinQd^MpsinOsin^dt^)  (psinOsin^  •••  sin^^gdjj)^^) 

=  p  sm  0sm  (j>1  •••  sin$2  gdfd^  •••  d<t>2n-2  (5.59) 

where  p  is  the  radius  of  the  hypersphere,  and  0,  •  •  »4>2n_g  are 

successive  hyperconical  angles,  corresponding  to  the  latitude  on  a 
3-dimensional  sphere,  ranging  from  0  to  tr,  while  < p ^  2  planar 

angle,  allowed  to  range  from  0  to  tt  . 

The  probability  of  0  occurring  between  0  and  6^  is  the  ratio  of 
the  surface  content  of  a  segment  of  the  hypersphere  cut  out  by 

O<0<0n  ,  0 «p.<tr  i  =  1 , . . .  ,2n-3 ,  0«j^  <2tt 

w  i  2n-2 

which  is  proportional  to,  from  (5.59),  (with  p  =  1) 


P(e<90)  =  [““sin^edej’sin2”-3^*!  —  jjsln^jd ♦2n.3(3’  d+2„_; 

=  K|®°sii 


Jsin2n'20d0 
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(5-60) 


2n  2 

Since  as  n  increases,  sin  0  decreases  for  9  t  tt/2  ,  the 

probability  that  0  deviates  from  tt/2  decreases  as  n  increases,  and 
the  averages  of  cos  0  and  cos20  approach  0.  This  lends  justification 
to  the  belief  that,  on  the  average,  the  adaptation  loss  L  will  be  small, 
so  that  a  neuromime  will  tend  to  "remember"  its  previous  training. 
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Section  VI 


STRUCTURE  AND  OPERATION  OF  NEUROMIME  NETS 


With  the  development  in  previous  sections,  we  can  now  discuss  in  a 
unified  manner  the  structure  and  behavior  of  a  neuromime  net,  including 
some  ideas  on  the  use  of  the  net  simulators  currently  under  development. 

Recall  that  nets  are  organized  as  a  series  of  computational  areas  or 
logical  levels,  each  area  containing  a  number  of  neuromime  components.  The 
S  inputs  of  the  components  within  an  area  come  from  the  outputs  of  other 
components  in  the  same  area,  while  the  I  inputs  come  from  outside  tne 
area,  either  from  the  pri  mary  sensory  inputs  or  from  the  outputs  of  com¬ 
ponents  in  other  computational  areas. 

A  neuromime  net  computation  area  is  composed  of  a  set  of  computing 
elements 


f | i-1, • . . ,m} 


together  with  their  connections  to  the  outside  world.  In  addition,  each 
component  has  a  number  of  state  inputs  which  are  outputs  from  other  com¬ 
ponents  in  the  same  area. 

If  r^  is  the  output  from  component  ,  the  effect  of  tne  interaction 
is  given  by  an  iterative  process,  whicn  proceeds  until  convergence  is  od- 
tained.  At  time  i+1,  r^  is  computed  from  the  outputs  of  the  other  compo¬ 
nents  at  time  i: 


r1(i+l)  =  k12r2(i)  + 


r2(i+l)  =  )<2iri(  i )  +  k23r3(i)  + 


0  O  0 


(6.1) 


o 


o 


o 


r  (i+1)  =  k  . r. (i )  + 
m  ml  1 


ients.  The  indices  can  be  interpreted 

on  the  out d ut  of  C. 

i 
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The  system  (6.1)  is  a  linear  system  of  equations,  and  can  be  re-expressed 


as 


P (i  +  1)  =  Mp(i)  +  p(0) 


where 


(6.2) 


r,  1 

12  13 

lm 

1 

k,.0  k  „ 
21  23 

•  •  •  k 

2m 

o 

• 

and  M  = 

•  • 

•  • 

• 

• 

• 

e  • 

*  j 

*m) 

k  k 

\  ml  m. 

m-l  °l 

and  p(0)  is  the  set  of  outputs  before  the  computation  of  the  interaction 
has  begun. 

Since,  at  the  beginning  of  the  iteration. 


p(l)  =  Mp(0)  +  p(0) 
(6.2)  is  by  induction 


p (i )  =  MXp(0)  +  M1-1p(0)  +  •••  +  Mp(0)  +  p(0) 


(6.3) 


=  l  p(0) 
j=0 

where  M°  is  defined  to  be  the  identity  matrix. 


(6.4) 


The  example  given  in  Figure  6.1  may  help  to  clarify  this  process.  The 
example  illustrates  the  simple  situation  where  m  =  2.  On  the  left  are  the 
symbolic  interaction  matrices  for  i  =  0,  1,  2,  3,  4;  and  on  the  right  are 
the  corresponding  matrices  with  k12  =  -.1  and  k21  =  .2.  At  the  bottom 

is  given  the  sum  of  the  five  matrices. 
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1 


M 

symbolic 


0 


\k21k12 


/k12k2l  0 


k21k12  / 


k12k2l 


M 

numeric 


11 

0  ' 

[1 

O' 

0 

\0 

1° 

1/ 

-.02 

\° 

0 


-.004 


/  0 

kl  2^ 

1 

/o 

-.l\ 

\k21 

°  1 

i*2 

0  / 

-.02 


+  .002 


^ ( k i 2k2 1 ) ^  ' 

i . 

1 +.0004 

0  ' 

\0  (k2lk12>2,/ 

10 

+.0004/ 

4 

I 

M1  =  | 

/ .9804 

-.098\ 

i=C 

) 

1.196 

.9804 

Figure  3.1  Interaction  Matrices  of  Dimension  2. 
ki2  =  -.1  $  k2i  =  .2 
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If 


4 

p(4)  =  l  M1  p (0)  (6.5) 

i=0 


then 


r  (4)  =  .9804  r,  (0)  -  .098  r  (0) 
1  1  2 


(6.6) 


r^(4)  =  .196  r^  (0)  +  .9804  r^O)  (6.6) 

As  might  be  expected,  the  output  from  Cj  is  decreased  and  the  output  from 
C2  is  usually  increased,  unless  r^O)  is  very  small  compared  with  ^(0), 
in  which  case  the  second  order  coupling  of  C2  with  itself,  given  by 

(M2)22  =  klk2  =  '*02  (6.7) 

becomes  dominant. 

A  more  complex  example  illustrating  so rnnH  order  coupling  more  clearly, 

is  given  in  Figure  6.2,  showing  M2  for  m  =  4.  Each  eni *  v  £n  ^he  matrix 

M2,  (M2).  . ,  is  interpreted  as  the  second  order  coupling  of  C.  w q 

13  J  i  * 

Specifically, 


(M2) 


11 


k12k21 


k  1  3k  3 1 


kl4k41 


(6.8) 


is  the  coupling  of  C  on  itself  through  C ,  plus  the  coupling  of  on 

itself  through  ,  plus  the  coupling  of  on  itself  through  . 

Similarly, 


(M2) 


34 


*  k31kU 


k  k 
32  24 


(6.9) 


is  the  coupling  of  C4  on  C3  through  plus  the  coupling  of  C4  on 

C3  through  . 

A  problem  of  importance  is  whether  the  system  will  eventually  produce  a 
stable  output,  which  is  the  same  as  determining  whether 


i 

lim  l  M3  =  M* 
i-*»  j  =  o 
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II 

CM 

s; 
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Figure  ^2 

Interaction  Matrices  of  Dimension 


exists.  It  has  been  demonstrated  in  Appendix  V  that  the  system  will  be 
stable  if  the  determinant 

|l  -  M|  >0  (6.11) 

One  scheme  for  interconnecting  elements  so  as  to  obtain  the  most 

flexible  coupling  coefficients  is  that  discussed  in  Figure  2.5,  where  r. 

th  f  -  ^ 

is  connected  to  both  the  Z  S  and  S  inputs  of  ,  The  interaction 

given  by 

rjgt  -  TjH  rj(4  -  8?  <6-12) 

and  the  coupling  coefficient  is 

kij  *  Ms(gI  -  g't>  <6-13) 

It  is  now  possible  to  state  in  a  unified  way  the  operations  performed 
by  a  net  of  m  components  each  of  which  has  n  inputs*  We  require  a 
number  of  definitions,  and  will  re-define  some  of  the  symbols  used  in 
previous  sections  to  obtain  a  clearer  statement* 

Let  N  be  the  incidence  matrix  of  the  net,  which  describes  its  inter¬ 
connection  structure* 


"  * 


where  n^  =  1  if  the  output  of  is  connected  to  the  state  input  of 

C.  •  N  is  an  mxm  matrix* 

1 

The  inputs  to  and  parameters  of  each  component  will  be  included  in  a 
series  of  mxn  matrices,  the  i ^  row  of  which  will  correspond  to  the 
associated  vector  of  the  i**1  component*  These  matrices  are  defined  in 
Table  6.1. 
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Table  6.1 


Variable  Matrices  for  Net  Level 


m*n  Matrix 

.th 

i  row 

X 

the 

inputs  to 

C. 

i 

ordered  as  in  (4.19) 

X 

the 

-*■ 

vector  m 

of 

as  in  (4.19) 

G 

the 

g-set  of 

C. 

l 

as  in  (4.19) 

D 

the 

vector  d 

of 

as  in  (4.19) 

Parameters  common  to  the  net  are: 

6  -  vector  each  of  whose  ra  components  is  the  bias 
parameter  B  of  (2.1) 

-  the  parameter  of  (2.1) 

w^  -  the  parameter  of  (2.1) 


There  must  be  a  set  of  deviation  vectors  for  the  interaction  inputs 
at  each  component.  These  can  be  obtained  from  the  output  vector  p  and 
the  incidence  matrix  N  as  follows: 


E.  . 


m 

l  Ni£r£ 

£=1  *  * 


m 

l  Nu 

H=1  * 


(6.14) 
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The  incidence  matrix  acts  to  pick  out  those  outputs  r^  which  are  attached 
to  component  C^,  and  the  sum  across  row  i  of  N  is  the  number  of  r^ 
attached  to  . 

The  result  of  adaptation  may  be  expressed  by  multiplying  each  row  of 
the  deviation  matrices  by  an  "adaptation  coefficient"  determined  as  in 
Section  5.  These  coefficients  will  be  gathered  in  the  vectors  A^  and  A^, 
coefficients  for  the  external  inputs  and  state  inputs,  respectively.  The 
operation  of  multiplying  the  i*“  row  of  a  matrix  A  by  the  i**1  component 
of  a  vector  a  where  A  has  p  rows  and  a  has  p  elements ,  will  be 
denoted  by 

B  =  a®A  (6.15) 

Another  operation  used  is 

a  =  diag(A)  (B.Jfi) 

a  is  the  vector  whose  elements  are  the  diagonal  elements  of  the  matrix  A. 

We  can  now  state  the  sequence  of  operations  of  a  net.  First,  the  out¬ 
puts  are  computed: 


T  =  WjX 
00 

M*  =  l  M1  (6.17) 

i=0 

p  =  M*[ diag(TGt)  +  6] 

Next,  the  matrices  M  and  G  are  altered  according  to  the  adaptation: 

S'  =  AX®D  +  G 
M '  =  w2A?®E  +  M 

Experimental  nets  can  be  constructed  and  tested  using  the  simu¬ 
lators  under  development.  The  general  purpose  digital  si’^ulator 
is  currently  operational,  and  its  structure  is  closely  related  to  the 
theoretical  structure  developed  in  this  report. 
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An  important  property  of  this  simulation  is  that  nets  can  be  constructed 
on  a  statistical  basis,  freeing  the  programmer  from  tediously  specifying  the 
interconnections  among  all  of  the  components  on  an  individual  basis.  In  this 
construction,  a  metrical  structure  is  imposed  on  the  net  so  that  a  distance 
is  defined  from  one  component  to  another  component  on  the  same  or  a  different 
level,  including  a  distance  from  the  primary  sensory  inputs  on  the  first  level. 
A  standard  distance  function  is  available,  but  the  programmer  may  arbitrarily 
define  a  new  one  if  he  wishes. 

The  program  generates  a  net,  making  connections  between  components  on 
a  probabilistic  basis,  the  probability  depending  on  the  distance  in  some 
fashion.  There  are  several  standard  probability  function*  in  the  program, 
and  The  programmer  may  also  define  new  ones. 

Another  feature  of  this  simulator  is  that  it  has  a  criterion  for  the 
activity  level  of  a  computational  area,  and  automatically  adjusts  the  bias 
level  to  neep  the  activity  within  a  certain  pre-established  range.  This 
device  controls  the  relative  importance  of  the  communication  channels 
between  different  areas,  so  that  certain  information  paths  are  given  more 
weight  than  others  on  an  a  priori  basis.  It  also  functions  as  an  overall, 
gain  control. 
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Section  VII 


UNSOLVED  PROBLEMS 

During  the  course  of  this  program*,  a  number  of  interesting  problems 
were  formulated  but  not  solved.  These  problems  motivated  the  development 
of  the  formalism  of  this  report,  and  should  be  amenable  to  the  attack  of 
these  new  tools.  In  addition,  a  number  of  extensions  to  the  formalism 
were  suggested  as  the  work  proceeded,  but  were  not  on  the  main  line  of 
endeavor,  so  are  as  yet  undone.  This  section  is  a  compendium  of  these 
ideas,  and  they  will  form  a  basis  for  further  research. 

Our  knowledge  of  the  theoretical  properties  of  neuromime  nets  must  be 
put  to  an  experimental  test  to  ensure  that  the  model  is  valid,  A  natural 
test  is  to  make  use  of  data  generated  by  the  laboratory* s  elaborate  sound 
generating  and  anditory  system  simulators. 

Since  the  currently  available  neuromime  net  simulation  equipment 
does  not  allow  time  varying  inputs,  we  must  restrict  our  attention  to 
quantities  which  are  either  independent  of  starting  time  or  can  be  synchro¬ 
nized  in  the  sampling  apparatus.  This  restriction  limits  us  to  considering 
only  fixed  frequency  transient  signals  or  very  noisy  or  periodical  high 
frequency  signals,  since  the  former  can  be  generated  with  associated  synchro¬ 
nization  pulse,  and  the  latter  can  be  considered  as  stationary  time  series 
and  has  observable  properties  independent  of  the  starting  time. 

A  reasonable  series  of  experiments  would  therefore  begin  by  attempting 
to  differentiate  simple  transients,  progressing  to  more  complex  transients. 

An  attempt  could  then  be  made  to  make  equivalent  certain  sets  of  complex 
transients  while  differentiating  between  sets.  Later  experimentation  would 
include  high  frequency  continuous  signals. 

An  important  aspect  of  this  series  of  experiments  is  an  investigation 
of  the  coupling  between  the  neuromime  net  and  its  environment,  especially 
from  the  learning  point  of  view.  Veneed  to  know  how  the  sequences  of  stimuli 
should  be  presented  and  how  we  should  reinforce  the  network.  The  work  of  Rosen¬ 
blatt  (1062)  on  the  perceptron  will  be  of  great  assistance  in  this  area.  It 
is  likely  that  neuromime  nets  have  information  processing  capabilities  equal 
to  or  better  than  perceptrons,  so  we  should  be  able  to  apply  many  of  the 
learning  theorems  which  he  has  derived. 
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Additional  work  should  be  done  on  this  model  in  the  areas  of  statistical 
properties  of  interconnection  matrices,  both  internal  and  external.  It  would 
seem  that  concepts  of  information  theory  could  be  employed  to  good  effect, 
especially  if  related  to  the  topological  structure  of  the  net. 

A  formulation  which  might  be  of  use  in  this  problem  is  that  of  channel 
capacity  density.  We  can  define  a  cross  section  of  an  information  channel 
in  such  a  way  as  to  identify  sets  of  points  which  a  signal  propagating 
through  the  channel  will  occupy  simultaneously.  Some  examples  are  given  in 
Figure  7.1.  Part  (a)  shows  a  transmission  line,  where  the  cross  section  is 
a  point.  Part  (b)  shows  transmission  along  a  plane,  the  cross  section  is 
a  line.  Part  (c)  shows  the  transmission  through  a  network,  the  cross  sections 
are  sets  of  points. 

If  we  defined  a  function  giving  channel  capacity  per  unit  of  cross  section, 
we  can  consider  this  a  channel  capacity  density  function.  By  examining  this 
function,  we  should  be  able  to  derive  many  statistical  properties  of  neuromime 
net  information  handling.  The  work  of  Inselberg,  Von  Foerster*  et  al  (1963) 

is  relevant  to  this  area. 

The  control  theoretic  formulation  of  the  interactions  in  a  net  should 
produce  some  more  useful  information.  Appendix  V  discusses  this  area  in 
more  detail.  A  possible  approach  here  would  be  to  combine  the  statistical 
interconnections  with  the  control  formulation,  and  develop  a  statistical 
control  theory. 

Implicit  in  the  discussion  in  Section  III  is  that  a  neuromirae  net  can 
be  considered  as  a  sort  of  flexible  measuring  device  on  classes  of  input 
stimuli,  measuring  the  presence  of  certain  patterns,  and  recalibrating  itself 
as  it  proceeds.  The  output  of  a  net  can  be  considered  as  a  sort  of  activity 
layer,  where  activity  is  localized  in  spots  corresponding  to  particular 
patterns,  as  in  Figure  7.2(a).  If  the  net  can  be  trained  to  respond  to  a 
new  pattern,  while  keeping  its  response  to  the  old  ones,  a  new  output  area 
might  be  formed,  displacing  some  of  the  others,  as  in  Figure  7.2(b).  It 
would  seem  that  this  property  of  Neuromime  net  processing  could  be  used  to 
develop  a  theory  of  the  operations  performed  on  patterned  sets  of  stimuli. 
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Cross  section 


(a) 

Line  transmission 


(b) 

Homogeneous  Plane  Transmission 


Network  Transmission 
Figure  7.1.  Channel  Cross-sections 
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Inputs 


a.  Original  Pattern  Sensitive  Areas 


Inputs 


b.  Addition  of  New  Area 

Figure  7.2.  Neuromime  Nets  as  Variable  Measuring  Devices 
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Properties  of  patterns  that  are  relevant  to  the  above  structure  are 
that  they  require  both  a  knowledge  of  the  outputs  from  a  set  of  sensors 
and  a  knowledge  of  the  location  of  these  sensors  in  the  outside  world. 

One  may  consider  a  sensory  field  with  a  topological  structure  as  a  window 
on  the  world.  The  pattern  measurement  device  has  a  number  of  different 
kinds  of  sensors,  and  these  are  attached  to  the  field,  sometimes  with 
several  different  kinds  attached  to  the  same  area.  A  pattern  is  defined 
with  two  characteristics,  a  particular  configuration  of  outputs  from  a  set 
of  sensors,  and  a  particular  topological  arrangement  of  that  set  of  sensors. 

In  most  cases,  it  is  useful  to  have  the  pattern  meter  measure  the  degree 
of  presence  or  the  probability  of  presence  of  a  pattern  rather  than  make  the 
outright  statement  that  the  pattern  is  or  is  not  present.  For  this  reason, 
the  pattern  meter  must  measure  not  only  how  near  is  the  sensory  output 
configuration  to  a  standard,  but  also  how  near  is  the  topological  arrangement 
of  the  sensors  to  its  standard.  For  example,  a  straight  line  in  a  visual 
field  is  determined  both  by  a  constant  contrast  level  in  a  set  of  sensors, 
and  by  the  nearness  to  a  straight  line  of  the  sensors  exhibiting  the  desired 
configuration  of  outputs.  An  observer  would  say  that  the  stimulus  in  Figure 
7.3(a)  was  a  straight  line,  and  would  say  also  that  the  stimuli  in  (b),  (c), 
and  (d)  were  nearly  straight  lines. 

A  neuromime  component  can,  in  itself,  adjust  the  particular  subfield 
of  the  sensory  field  it  observes  (by  setting  some  of  its  g-weights  to  zero), 
and  its  output  gives  a  measure  of  nearness  of  the  input  stimulus  to  a 
standard,  so  we  can  hope  that  neuromime  nets  can  be  constructed  to  measure 
both  kinds  of  distances. 

Besides  the  foregoing  investigation  of  the  properties  of  neuromimes 
and  nets  which  can  be  simulated  by  currently  available  programs,  there  are 
extensions  that  should  be  made  in  the  theoretical  and  conceptual  structure 
to  give  the  study  more  direction. 

One  such  extension  is  the  reformulation  of  the  neuromime  equations  so 
that  they  make  sense  with  rapidly  time-varying  inputs.  It  would  be  useful 
if  the  time -varying  formulation  reduced  to  the  constant  formulation  when  the 
inputs  were  constant. 
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a.  Straight  Line 


b.  Some  Points  Missing 

(deficient  in  attributes) 


c.  Points  off  Line  d.  Both  deficiencies 

(Deficient  in  Geometry) 


Figure  7.3.  Nearly  Straight  Line  Stimuli 
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A  possible  approach  to  this  problem  is  linear  prediction  as  described  by 
Wiener  (1949  )  and  extended  by  Allais  (19G4).  Assume  we  have  a  random,  function 
f(t),  and  we  wish  to  predict  the  value  of  f(tQ  +t)  for  some  future  increment 
t  ,  where  t  is  the  current  time.  We  are  free  to  use  any  information  from 
to  t  to  determine  f(tQ  +  T).  'Wiener,  therefore,  defines  a  Kernel 
function  K(t,t),  and  a  predictor 


P(t  ,  t) 


t)  f(t)dt 


(7.1) 


The  goodness  of  the  predictor  is  given  by  the  mean  square  error 


e(tQ,  t)  =  E{[P(tQ,  t)  -  f(tQ  +  t)J2} 


(7.2) 


According  to  Allais,  there  is  no  reason  why  we  must  confine  ourselves 
to  predicting  the  future  values  of  f(t).  This  prediction  theory  applies 
equally  well  to  any  function  h(f,  t)  which  is  correlated  with  f;  for  example, 
f  could  be  an  electrocardiogram,  and  h  could  presumably  be  a  function  which 
is  zero  before  the  QRS  complex  and  one  after  it,  or,  more  interesting,  a 
function,  which  is  one  when  the  QRS  complex  passes  a  certain  point  in  time, 
and  zero  otherwise. 

It  would  thus  seem  that  if  the  g-sets  were  defined  as  predictor  functions, 
so  that  g(t)  is  a  kernel  function  defined  on  the  interval  ( -°° ,  tQ),  and  the 
output  is  computed  by 


R(t  ) 
0 


,V 


g(t)  •  x(t)dt 


(7.3) 


2n 


=  l 


0 


i=l 


g^t) 


x^(t)dt 
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we  could  treat  the  time-varying  case.  The  problem  of  adapting  the  g-weights 
could  be  handled  in  some  such  way  as  setting 

g(t)  =  ( ag ( t )  +  (1  -  a)  x(t ) )R(tQ )  (7.4) 

for  (0  <  a  <  1)  ,  but  this  is  a  problem  which  merits  some  thought. 

Note  that  the  computation  of  R  is  similar  to  the  constant  case  as  in 
(2,2).  Component  interaction  could  perhaps  be  defined  in  the  same  way  as 
in  (2,2)  since  the  feedback  characteristics  of  a  net  are  independent  of  the 
input  functions  (See  Appendix  V), 

A  beginning  on  a  theory  of  organization  was  made  during  this  work.  It 

employs  a  value  system — a  hierarchy  of  goals  which  the  system  is  to  pursue. 

A  sample  system  is  given  in  Figure  7.4.  Some  of  the  goals  contribute  to 

others,  and  some  are  independent  of  each  other. 

A  single  goal-seeking  unit  is  a  mechanism  which  monitors  a  series  of 

inputs  to  determine  the  state  of  the  world,  performs  some  computations,  and 

issues  a  set  of  instructions  to  the  set  of  mechanisms  available  to  it  to 

change  the  state  of  the  world.  (The  world  can  include  the  system  itself. ) 

A  schematic  diagram  of  a  box  appears  in  Figure  7.5. 

The  key  concept  here  is  that  the  world  can  be  thought  of  as  a  space  of 

states  (perhaps  a  vector  space).  It  is  well  known  that  a  representation  of 

a  vector  in  the  form  (a,,  a  ,  ...  .  a  )  means  a  a  t  a  a  t  a,a  t  •••  t  a  a  , 

1*2  n  11  22  33  n  n* 

where  (a  ,...,a  )  are  a  set  of  reference  vectors,  usually  an  orthonormal 
in 

basis.  We  can  consider  the  sensory  input  vector  I  to  be  a  representation 

12  n 

of  the  world  in  the  set  of  reference  vectors  (aT,  cu ,. . .  ,aT ) ,  where  each 
k  ill 

a  can  be  considered  to  refer  to  that  portion  of  the  world  "seen"  by  the 
kth  sensory  input  mechanism.  The  reference  vectors  (aT>  are  not  necessarily 
linearly  independent.  Similarly,  the  set  C  of  commands  to  the  system’s 
effector  devices  is  a  vector,  represented  in  a  set  of  reference  vectors 
(a^,...,a^),  where  each  ct£  represents  the  change  in  the  world  produced  by 
a  unit  instruction  to  a  particular  effector  device. 

#  -4- 

The  goal  in  the  goal -seeking  unit  is  represented  by  a  vector  G  which 
is  the  desired  state  of  the  external  world  as  represented  by  the  input 
mechanism.  The  instruction  vector  is  computed  from  the  deviation  between 
the  actual  inputs  and  the  desired  inputs. 
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Figure  7.4,  Sample  Goal  Lattice 
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,ure  7.5.  Goal-seeking  Unit  with  its  Connection  to  the  World 
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(7.5) 


D  =  I  -  G. 

The  desired  action  is  to  alter  the  world  so  that  D  will  become  0, 
so  that  the  set  of  instructions  to  the  effector  set  will  represent  a 
change  of  state  of  the  external  world  equal  to  the  deviation  of  the  input 
vector  I  from  the  goal.  This  can  be  done  by  considering  an  m  *  n 
transform  T  mapping  {a^}  into  {ap,  so  that 

£  =  -TD.  (7.6) 


T  is  not  necessarily  a  linear  transform. 

Competition  between  various  goals  is  introduced  when  contradi ctory 
instructions  are  given  to  the  effector  set.  To  deal  with  this,  it  is 
necessary  to  introduce  a  system  of  priorities.  This  is  done  by  assigning 
a  unit  value  V  to  the  primary  goal.  This  value  is  distributed  by  the 
primary  goal  to  its  subgoals,  which  in  turn  distribute  value  further  down 
the  lattice.  Sometimes  a  goal  will  receive  value  from  two  or  more  super¬ 
goals,  in  which  case  the  values  can  be  considered  to  add  together. 

Not  all  of  the  value  is  transmitted,  however.  If  a  goal-seeking  unit 
has  a  capacity  to  give  instructions  to  the  effector  set  (some  may 
not — acting  as  logical  goals  and  leaving  all  action  to  subgcals),  some  of 
its  value  must  be  assigned  to  its  commands.  This  is  represented  by 
in  Figure  7.5.  (Transmitted  value  is  represented  by  V  . ) 

A  possible  distribution  system  is  to  construct  a  vector  U  where 


U. 

l 


v.c.w. . 

C  i  i 


(7.7) 


W  is  a  weighting  vector  which  normalizes  the  utility  of  each  individual 
-> 

instruction.  W  is  assigned  so  that 


n 

l  U  <  V  (7.8) 

i=l  L 

for  all  C  which  can  occur  in  practice.  This  is  to  allow  for  assignment 
of  large  value  to  emergency  goals  which  are  allowed  to  override  normal 
goals  when  need  arises.  Usually  the  U  vector  will  be  small  enough  so 
that  other  instructions  will  take  precedence. 


70 


Precedence  is  decided  by  the  effector  mechanism  concerned  by  some  rule 
according  to  the  nature  of  its  action.  If  it  can  execute  only  one  instruction 
at  a  time,  it  will  execute  the  one  with  greatest  value.  If  it  is  a  distribu¬ 
tion  mechanism  it  can  fill  orders  in  order  of  value,  in  proportion  to  the 
product  of  need  and  value,  or  in  some  other  manner. 

This  sort  of  organization  structure  would  be  the  organization  of  a  "bug", 
an  organism  completely  pre-programmed  to  behave  in  a  complicated  way. 

Neuromime  nets  are  adaptive  mechanisms,  so  it  is  reasonable  to  expect  that 
devices  could  be  made  with  some  of  the  following  abilities. 

1.  For  a  goal-seeking  unit  to  construct  its  own  T. 

2.  For  the  system  to  distribute  value  throughout  itself  so  as  to 
optimize  its  behavior  in  seme  way. 

3.  For  a  unit  to  seek  new  inputs  if  it  cannot  differentiate  between 
signals  that  it  must  differentiate,  or  throw  away  inputs  if  it  does  not  need 
them. 

4.  If  two  units  find  themselves  in  competition  to  mutual  disadvantage, 
for  them  to  form  a  coalition  by  generating  a  new  super  box  which  can  regulate 
them  in  some  way. 

5.  For  the  system  to  be  able  to  modify  its  goaJL-seeking  units,  even  to 
the  extent  of  adding  or  deleting  entire  units. 

Possible  mechanisms  for  such  adaptation  might  oe  gained  by  looking  at 
some  natural  mechanisms.  One  such  mechanism,  homeostasis, has  been  considered 
in  a  primitive  way  by  Ashby  [11].  This  is  the  tendency  for  a  system  to 
attempt  to  return  to  an  equilibrium  state  after  being  perturbed.  It  is  the 
mechanism  behind  the  computations  of  a  goal-seeking  unit  as  described 
previously. 

Another  mechanism  which  might  be  applicable  is  that  possibly  used  by 
nature  in  evolution.  The  parameters  of  the  system  are  changed  at  random 
and  the  resulting  system  evaluated  for  goodness  of  performance  in  some  way. 

If  it  is  better  than  the  old,  it  replaces  the  old,  being  otherwise  discarded. 

A  conceptual  structure  which  might  cast  some  light  on  the  organization 
of  such  a  system  without  a  homunculus  is  Wiener’s  []2]  concept  of  the  brain 
as  a  set  of  coupled  nonlinear  oscillators.  This  mathematical  formalism  might 
also  be  relevant  to  the  design  of  neuromimes  with  time-varying  inputs. 
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APPENDIX  I 


SOME  PROPERTIES 

OF  THE  n-DIMENSIONAL  OCTAHEDRON 


The  development  will  be  largely  intuitive  and  what  arguments  are 
given  will  be  largely  combinatorial  in  nature.  For  those  unfamiliar  with 
n-dimensional  geometry,  a  good  mathematical  treatment  is  given  in 
An  Introduction  to  the  Geometry  of  N-Dimensions  by  D.  M.  Y.  Sommerfield, 
Dover,  1958. 

The  Fourth  Dimension  Simply  Explained  edited  by  Henry  P.  Manning, 
Dover,  1960,  presents  several  intuitive  ways  of  imagining  higher 
dimensional  spaces. 
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The  n-dimensional  octahedron  may  be  defined  to  be  that  figure 

bounding  the  region  of  space  determined  by  n  mutually  perpendicular 

coordinate  axes  x,  ,  ....  x  fulfilling  the  condition 

I  n 

|xj  +  +  lxJ  -  1. 

The  boundary  is  an  (n-l)-dimensional  surface  determined  by  the  equation 

|xj  +  •••  +  |xj  =  1. 

This  concept  may  be  developed  intuitively  by  considering  the  cases 
where  n  =  2  and  n  =  3.  In  the  former,  the  boundary  is  the 
1-dimensional  surface 


=  1, 


in  the  latter  the  boundary  is  the  2-dimensional  surface 


I 


(see  Figure  1). 

If  we  restrict  the  range  of  the  so  that  0  *  *  1,  we 

obtain  the  principal  section,  whose  boundary  is  given  by  the  coordinate 
planes  and  the  (n-l)-dimensional  hyperplane  determined  ny  the  boundary 
equation,  which  becomes 


Xx  +  x2  +  •••  +  xn  =  1. 

X 

This  hyperplane  is  defined  as  an  nth  order  face  of  the  octahedron.  There 
are  clearly  2n  nth  order  faces,  since  it  is  possible  to  divide  the 
interior  into  2n  sections  by  restricting  some  of  the  x^  to  the  range 
-1  <  <  0  and  the  others  to  the  range  0  <  x^  5  1,  (£ach  x^  can  be 

restricted  to  either  of  two  ranges,  and  there  are  n  of  the  x^'s.) 
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Two-dimensional  Octahedron 


1 


Three-dimensional  Octahedron 

1 


-1 


Figure  1 
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In  each  section  a  set  of  vertices ,  (v^,  •••>  vn^»  defined  by 

the  intersection  of  the  face  with  the  coordinate  axes.  These  vertices 
define  a  region  in  the  (n-l)-dimensional  hyperplane,  and  this  region  is 
an  (n-l)-dimensional  simplex  with  its  interior.  This  corresponds  to  a 
triangle  in  2-dimensional  space  and  a  tetrahedron  in  3-space.  The  edges 
of  the  simplex  are  the  set  of  lines  connecting  the  vertices  taken  two 
at  a  time.  These  edges  are  all  of  equal  length,  since  each  one  forms 
the  hypotenuse  of  a  right  triangle  whose  sides  equal  1,  so  that  the 
simplex  is  a  regular  simplex.  This  corresponds  to  an  equilateral 
triangle  in  2-space  and  a  regular  tetrahedron  in  3-space.  (See  Figure  2.) 

The  regular  simplex  has  the  property  that  if  any  r  of  its  vertices 
are  taken  away,  the  remaining  n  -  r  vertices  define  an 
(n-r-l)-dimensional  simplex.  This  corresponds  to  taking  the  intersection 
of  the  nth  order  face  with  the  (n-r-l)-dimensional  hyperplane  containing 
n  -  r  of  the  n  coordinate  axes.  For  n  =  4  and  r  =  1  this 
corresponds  to  the  intersection  of  the  hyperplane  containing,  say,  the 
Xj ,  x^ ,  and  x^  coordinate  axes  with  the  tetrahedron  (v^ ,  v^ ,  v^,  v^) 
of  Figure  2,  resulting  in  the  equilateral  triangle  (v^ ,  v^ ,  v^). 

We  will  call  the  intersection  with  the  nth  order  face  of  a 
(k-l)-dimensional  hyperplane  containing  k  vertices  in  the  principal 
section  a  kth  order  face.  This  is  a  (k-l)-dimensional  simplex  containing 
k  of  the  n  vertices. 

Since  a  kth  order  face  is  a  slice,  as  it  were,  of  the  nth  order  face, 
the  points  of  both  its  boundary  and  interior  satisfy  the  boundary 
equation  (see  Figure  2). 

It  is  easy  to  count  the  number  of  kth  order  faces  in  an  n-dimensional 
octahedron.  A  single  kth  order  face  is  a  simplex  whose  vertices  are  k 
of  the  n  vertices  of  an  nth  order  face.  The  number  of  combinations 
of  n  things  taken  k  at  a  time  is 

n|_  n! _ 

k |  k! (n  -  k) ! 

There  are  2n  nth  order  faces,  and  each  k ^  order  face  is  in  co,:  ion  to 
2n  K  sections,  making  fn)  K 

U  I2 
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kth  order  faces  in  the  entire  octahedron,  (A  first-order  face  is  defined 
to  be  a  vertex,  and  a  second-order  face  is  an  edge,) 

It  should  be  noted  that  a  kth  order  face  is  a  face  of  a  k-dimensional 
octahedron  as  well. 

We  are  going  to  want  to  divide  the  principal  section  into  regions. 

To  do  this,  it  is  useful  to  consider  the  problem  in  the  nyperplane  of 
the  face.  We  can  orient  ourselves  in  this  hyperplane  by  considering  the 
location  of  various  solutions  to  the  boundary  equation. 

Consider  the  case  with  n  =  4.  The  boundary  equation  is 


x  +  x  +  x  +  x  =1 
12  3  4 


and  this  defines  the  interior  and  boundary  of  a  regular  tetrahedron,  as 
in  Figures  2  and  3.  Figure  3  is  a  map  of  this  tetrahedron.  The  points 
are  given  as  quadruplets,  having  the  values  of  the  coordinates 
x  ,  x  ,  x  ,  x  in  order. 

The  partitioning  to  be  considered  in  detail  is  that  defined  by  the 

inequalities  x^  >  x^.  In  other  words,  we  want  to  take  every  point  whose 

x.  coordinate  is  greater  than  its  x.  coordinate,  the  other  coordinates 
1  i 

unconstrained,  put  them  into  a  regicn,  and  put  those  points  not  fulfilling 
that  condition  into  another.  The  boundary  between  the  two  regions  is  the 
hyperplane  x^  =  x ^ .  This  is  shown  in  the  2-dimensional  and  3-dimensional 
case  in  Figure  4,  and  in  the  4-dimensional  case  in  Figure  5. 

We  note  that  the  set  of  hyperplanes  x^  =  x  separate  every  kth 
order  face  contained  in  the  nth  order  face  bounding  the  principal 
section.  This  is  true  since  two  kth  order  faces  are  determined  by 
the  equations : 


x . 
i 


1 


+  •  •  •  +  X.  =  1 
1, 


X. 

1 


0  for  i  t  {i  , 


.  • .  , 


+  *  •  •  +  x  .  =  1 


Xj  =  0  for  j  i  {j1 


77 


V 


Two-dimensional  Simplex  (Equilateral  Triangle) 


Three-dimensional  Simplex  (Regular  Tetrahedron) 


Figure  2 


^l.O.O.O) 


A  -  center  of  triangle  VjV^^ 

B  -  center  of  triangle  v2v3vi, 

C  -  center  of  tetrahedron 
D  -  I  -  center  of  edges  as  indicated 


Figure  3.  Map  of  Tetrahedron 


,0,0) 
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(0,1) 


Two-dimensional  Case — Plane  Divided 


Three-dimensional  Case — Showing 

Six  Regions  in  Plane  of  Face,  Above 
Below,  Stage  in  Construction 
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(1, 0,0,0) 


figure  5 .  Four-dimensional  Region  Seen  in  Hyperplane 
of  a  Face,  Showing  Intersection  of 
Four-dimensional  Region  with  Face 
in  Tetrahedron  Vj  ABC 
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It  is  also  the  case  that  all  the  hyperplanes  which  intersect  a  given 

kth  order  face  intersect  in  a  point  which  is  the  center  of  the  face.  This 

can  be  seen  by  considering  the  particular  kth  order  face  determined  by 

the  vertices  (v.  ,  . ..,  v.  ).  The  hyperplanes  which  intersect  this  face 
X1  1k 

are  given  by  the  equations 


x. 

1 


1 


•  •  •  9 


X. 

1 


2 


x . 
l 


=  X. 


k-1 


all  of  which  are  satisfied  by  the  point 


x.  =  x .  =  • 0  •  =  x. 

ii  l. 

1  2  k 


1 

k* 


This  point  is  the  center  of  the  kth  order  face. 

We  can  now  see  that  a  region  defined  by  this  set  of  hyperplan^s  will 

be  defined  as  an  (n-l)-dimensional  i*egion  determined  by  a  series  of  vertex 

points  which  can  be  described  as  a  sequence  of  midpoints  of  successive 

kth  order  faces,  k  going  from  1  to  n.  This  figure  is  a  simplex  in 

n-1  dimensions,  as  can  be  seen  for  the  4-dimensional  case  in  Figure  5. 

The  region  is  a  tetrahedron,  and  the  vertex  points  are  v^ — a  1st  order 

face,  A — the  midpoint  of  the  2nd  order  face  (v^ ,  v^),  B — the  midpoint 

of  the  3rd  order  face  (v  ,  v  ,  v  ) ,  and  C — the  midpoint  of  the  4th 

14  3 

order  face . 

With  this  in  mind,  we  can  count  the  number  of  regions  determined  in 
an  n-dimensional  octahedron  by  the  set  of  hyperplanes  {x^  =  x^}.  If  we 
examine  the  sequence  of  vertices  defining  the  region  in  a  combinatorial 
sense,  we  see  that  we  can  choose  first  any  vertex  v^,.  Next,  we  must 
have  a  midpoint  of  an  edge,  with  the  constraint  that  the  edge  include 
the  vertex  v^  chosen  previously.  In  the  same  way  the  kth  vertex  point 
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of  the  region  is  the  midpoint  of  a  kth  order  face  which  must  contain  the 

(k-l)st  order  face  whose  midpoint  was  chosen  for  the  (k-l)st  vertex  point. 

Denoting  the  n  vertices  of  the  region  by  r  ,  r^,  we  see 

that  we  have  n  choices  for  r^  ,  n  -  1  choices  for  r^ ,  . ..,  and  only 

1  choice  for  r  .  There  are  thus 
n 

n*(n  -  l)*(n  -  2)*(n  -  3)  • 0  *  (1)  =  n! 


regions . 

The  problem  of  partitioning  the  principal  section  into  regions  can 
be  treated  more  generally  by  considering  a  set  H  of  hyp^rpl in  n 
dimensions  defined  by  the  m  equations 


a 


a 


0 


0 


+ 


+ 


a!*i 


m 

aixi 


t 


+ 


•  •  •  +  a*x 
n  n 


m 

•  •  •  +  ax 
n  n 


0 

0 


with  the  following  restrictions 

1.  at  least  one  a*  =  0  i  =  1,  . ...  n 

2.  the  ordered  set  (a^  k  =  1,  2,  n)  is  a  permutation  of  tiie 

ordered  set  (al  i  =  1,  2,  . ..,  n)  for  k  =  1,  2,  . m 

3.  m  -  the  number  of  distinct  permutations  of  (a!  i  =  1,  2,  n) 

4.  the  line  =  x^  =  0  0  •  =  x^  is  contained  in  every  hyperplane. 

In  the  hyperplane  of  the  nth  order  face  F  H  becomes  a  set  Hf 
of  hyperplanes  in  n  -  1  dimensions  intersecting  in  the  point 


x 


1 


X 


2 


X  = 

n 


1 

n  * 


the  center  C  of  the  nth  order  face,  H f  defines  a  set  of  simplexes 
in  F  all  of  which  have  C  as  a  vertex.  By  symmetry,  all  of  these 
simplexes  are  congruent,  and  the  (n-l)st  order  solid  angle  at  C  is 
equal  for  each  of  them.  The  sum  of  these  angles  is  the  complete  (n-l)st 
order  solid  angle  at  C,  so  that  the  number  P  of  regions  is  equal 
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to  the  complete  (n-l)st  order  solid  angle  Sn  at  C  divided  by  the 
(n-l)st  order  solid  angle  determined  at  C  by  a  single 

(n-1) -dimensional  simplex  formed  from  n  of  the  hyperplanes  in  H'. 

In  symbols. 


P  = 


n-1 

'n-1 


is  the  surface  content  of  the  unit  hypersphere  in  n-1 


dimensions. 


n-1 


2ir 

t 


(k 


for  (n-1)  even  (=  2k ) 


2irkk»  22k 

(2k  )! 


for  (n-1)  odd  (=  2k  +  1) . 


may  be  determined  by  considering  the  simplex  cut  out  in  the 
hypersphere  by  the  n  (n-1) -dimensional  hyperplanes  forming  the  given 
(n-l)st  order  solid  angle.  This  simplex  may  be  considered  as  a  simplex  in 
an  (n-2) -dimensional  spherical  space,  and  the  problem  is  reduced  to  finding 
the  volume  of  this  simplex. 

One  way  that  this  can  be  done  is  to  transform  the  equations  from  the 
n-dimensional  cartesian  coordinate  system  (xl,...,x  }  to  an  n-dimensional 
cylindrical  system  (z,  r,  ,  with  the  z-axis  on  the  line 

Xj  =  x2  =  ***  =  xn  •  The  resulting  expressions  with  z  =  1/*5T  ,  r  =  1  will 
give  the  intersection  of  the  set  of  hyperplanes  with  the  unit  hypersphere 
circumscribing  the  nth  order  face.  A  simplex  may  be  identified  and  its 
volume  computed  by  techniques  peculiar  to  the  particular  set  {a^}  chosen 
to  define  the  set  H  of  hyperplanes . 
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APPENDIX  II 


EXAMPLE  OF  USE  OF  G-SET  ADAPTATION  EQUATIONS 


We  will  demonstrate  the  usefulness  of  the  vector  formulations  of  the 
g-set  adaptation  equations  by  solving  the  problem  raised  in  Section  IV 
concerning  the  determination  of  the  configuration  of  the  g-set  when  the 
output  becomes  zero  during  negative  adaptation,  using  the  adaptation 
equation  developed  in  (5.26). 

Specifically ,  we  wish  to  find  g(t)  such  that 

x  •  g(t)  =  0  (1) 

From  (5.26),  we  can  expand  g(t),  so  that  (1)  becomes 


5  .  |(o)  ♦  M2)  (,Wt  -  .1)  ;  •  3  .  0 


(2) 


But,  from  (4.21), 

x  «  d  =  <5 

and  from  (5.47),  with  B  =  0  , 

R(0) 


x  •  g(0)  = 


M, 


(3) 


(4) 


so  that  (2)  becomes 


R(0)  tIMI6t 

157“  6  =  ° 


(5) 


Since  Fj<0,  this  will  occur  when  t  =  «,  therefore 


F  M  6t 
e  =0 


(6) 


Substituting  (6)  into  the  expression  (5.26)  for  g(t),  we  have  the 
solution: 


8<t>  *  1(0)  -  SM  J  . 

In  the  numerical  example  from  Table  4.2,  we  have 


(7) 


g(0)  = 


.5  ' 

1  -.1 

.5 

-► 

.1 

d  = 

-.5 

.2 

\-.5  J 

-2  / 

(8) 
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Mj  =  1,  6  =  .1,  R(0)  =  .1, 

so  that 

R(0)  


(9) 


and 


g(t)  = 


.6 
.4 
-.7 
-.3 


This  is  the  correct  answer,  since 


x  •  g(t)  =  .24  +  .24  -  .42  -  .6  =  0 


(10) 


(11) 
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APPENDIX  III 


SOLUTION  TO  THE  SYSTEM  OF  DIFFERENTIAL 
EQUATIONS  y»  =  u  v1  y  +  bu 

The  system  of  n  simultaneous  differential  equations 


yf  =  uvy  +  u  v  y  +  •«*  +  u  v  y  +bu 
y  \  1  ly  1  1  2y  2  1  nyn  1 


y  *  =  uvy  +  u  v  y  +  •  *  *  +  u  v  v  +  bu 

y  2  2  1J 1  2  2y  2  2  n"  n  2 


y*  s  uvy  +  uvy  + 

yn  n  \y  l  n  2  2 


+  u  v  y  +  b  u 
n  n^n  in 


can  be  conveniently  represented  in  the  form 

4  -*  -+ 

y !  =  A  y  +  b  u 


where 


y' 


/;A 


y  = 


y>  \ 

y2  1 


Vn  / 


(1) 


and 


A  = 


/  Vi 

u  V 

2  1 


u  v 

1  2 


u  v  1 

1  n 


u  v  •  •  •  u  v 
2  2  2  n 


uv  uv  °  •  uv  ] 

^  n  1  n  2  n  n  / 


**■  /^*  ■►t 


!  U 

v  \ 

|  2 

2 

• 

• 

-K 

u  = 

* 

v  = 

• 

1  • 

. 

« 

n  / 


v./ 
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v*  is  the  transpose  of  v  ,  i.e., 

?t  *  <V  v2  -•  vn> 


and  the  operation  0  is  the  matrix  product.  The  symbol  0  will  be 
emitted  whenever  possible . 

The  solution  of  this  system  is  given  by 


y  =  y 


homogeneous  +  ^particular 


(2) 


where  y,  is  the  solution  to  the  system 

^homogeneous 


yf  =  Ay 


(3) 


and  y  ^ .  is  an  additional  particular  solution  to  the  non- 

^ particular 

homogeneous  system. 

For  the  homogeneous  system,  (3),  we  assume  a  solution  of  the  form 


ke 


Xx 


(4) 


where  k  is  a  vector  of  constants.  Substituting  into  (3),  we  obtain 


Xx  +  Xx 

Xke  =  Ake 


or 


(A  -  XI)  k  =  0  (5) 

To  obtain  a  solution  other  than  k  =  0,  the  determinant 

|  A  -  XI  |  =  0  (6) 


It  should  be  noticed  that  (6)  is  the  characteristic  equation  for  A, 
having  as  solutions 


X  = 


X 

n 
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for  X  ,X  ,  ...,X  the  characteristic  values  of  A.  Associated  with  each 
1  2  n  +  + 

X.  is  a  vector  of  constants  k..  Each  k.  may  be  determined  up  to  a  single 
i  it 

arbitrary  constant  c^  by  substituting  into  (5)  and  solving  the  system 


(A  -  X.I)  =  0 


(7) 


In  order  for  (7)  to  hold,  k^  must  be  a  constant  times  the  characteristic 
vector  associated  with  X^.  If  cu  is  this  characteristic  vector. 


k.  =  c.a. 
l  ii 


(8) 


For  all  X^  distinct,  (8)  yields  the  complete  solution  for  the  system 
(3)  by  the  superposition  principle: 


-*■  X.x 

c  a  e  *  + 

1  1 


+  c  a  e 
n  n 


Xnx 


If  the  X.  are  not  distinct,  but 


X. 

i 


1 


X. 

1. 

1 


X 


0 


* 


then  instead  of  (4)  we  must  assume  a  solution  of  the  form 


/"t  "t  i*  ( j-lK  x  X 

y  =  (ki  +  k2x  +■>»•+  k^x  J  )  e  o 


(9) 


A 


In  (3),  the  case  at  hand,  we  observe,  following  Wilf  (1962),  ch .  1,  that 

/ - N. 

u  {y  v  is  of  rank  1,  since  for  any  vector  z 


Az  =  [u  0  v1]  0  z  =  u  O  tv1  O  ^  =  ( v*z )  u,  (10) 


■►t  /"*\  -*■  -y  -y 

V  Z  =  V  Z  +  •••  +  V  Z  =  V*  ; 

11  n  n 


(11) 


(v  •  z  is  the  inner  or  dot  product  of  the  vectors  v  and  z.)  (6),  therefore, 

must  reduce  to 
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0 


(12) 


An  “  1(X  -  Afl) 


having  as  solutions 


X,  =  Xn  ,  Xo  =  X,  =  •••  =  X  =  o. 
1  0  2  3  n 


(13) 


may  be  evaluated  by  recalling  that  the  trace  of  a  matrix  is 
equal  to  the  sum  of  its  characteristic  values,  (see  Wilf  (1962)  In  symbols 


tr(A)  =  uv  +uv  +  • • •  +  u  v  =  X  +A  +  • • •  +  X 
112  2  n  n  12  n 


(14) 


Substituting  (13)  into  (14)  and  noting  (11),  we  have 


tr(A)  =  u  •  v  =  X 


(15) 


The  characteristic  vector  a  associated  with  X  may  be  obtained 

0  0 

by  observing  that 


A  '  V  u 


'  -*t-,  -> 

[u  v  ]  { 9J  u 


(u  •  v )u  =  XQu 


We  have,  therefore,  that 


a  =  u 


and  one  solution  to  (3)  is  given  by 


+  ( u  * v  )x  X  x 

y  =  cue  =  cue  o 


(16) 


The  repeated  characteristic  value  0  causes  us  to  look  for  the 
remaining  solution  of  the  form  (9)  as 


■+  j  ->■  -v  ♦  (n-?V 

_  |  ^  +  k  x  r  k  *  +  * #  *  (-  k  xv  '  1  < 

y  1  2  ' 3  '  n-1 


(17) 


The  constants  ,  . 

(17)  into 


.  ,  ^  can  be  evaluated  by  substituting 
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y '  -  Ay  =  0 


(18) 


Differentiating  (17),  we  obtain 


y'  =  Ik  +  2k  x  +  • • •  +  (n-2)k  x^n 
2  3  n  - 1 


(19) 


Ay  =  u  •  v*  •  y  = ;  u  (v*k  )  +  (v*k  )x  +  •••  +  (v»k  )x^n  “ '  |(20) 

j  -  j  i  j  2  n-l 


Subtracting  (19)  from  (20)  and  gathering  terms. 


+ 


Ay  -  y*  =  [oM^u  -  f(v*k2  )u  -  21<3jX 

•••  +  £(v  •  kn_2)u  -  (n-2)kn_i^]xn_3  +  f^(v  •  k^JuJx0-2 


(21) 

=  0 


o  n-l  .  . 

Since  the  functions  1,  x,  x  ,  ...  ,  x  are  linearly  independent 
(see  [2]),  we  have  the  following  n-2  equations  by  setting  equal  to  zero 
each  coefficient  of  j  =  0,  1,  ...  ,  n-3 


-> 

k 

2 


(v 


)u 


-> 

k 


3 


)u 


n-l 


n-2 


"( v 


k  )u 

n-2 


(22) 


-V 

These  equations  are  all  dependent  on  k  ,  since  they  form  a  nested 
recursion  relation.  Solving  them  explicitly,  they  become: 


yi 


(23) 


-+ 

k 

2 


(u  •  k^u 

L  [v  •  (v  •  k^ )u]u  =  L  (v  •  u)(v  •  k^ )u 


iX  (v  •  k, )u 
2  0  1 


k 


n-l 


1 

(n-2  ) 


n-3  +  £  *-► 

x0  (v  '  ki)u 


(n-2  ) 

From  the  coefficient  of  xv  in  (21),  we  have  the  additional 

information  that 


(v  •  k  )u  =  0 

n-i 


(24) 


i  o  by  hypothesis,  so  that  we  must  have 


n-l 


.n-3 


(n-2 ) ! 


(v 


kjKi 


.n-2 


(n-2) ! 


(v 


V 


(25) 

0 


By  the  same  argument  as  in  (24),  we  must  have 


(26) 


Substituting  (26)  into  (23),  we  obtain 


K  =  k 
2  3 


=  k  =  0 

n-l 


(17)  therefore  reduces  to 


y  =  K 


and  substituting  into  (18),  we  have 
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0 


-Ak_ 


(27) 


If  we  define  k  =  k,  (27)  may  be  solved 


Ak  =  (u  ©  v*)  (3  k  =  (k  •  v)u  =  0 


u  t  0  ,  so  that 


or 


k  •  v  =  0 

k 


n 


vk  +  •  •  •  +  v  ,k 
1  1  n-1  n-1 


-v 


(28) 


leaving  n-1  arbitrary  constants . 

We  now  have  the  complete  solution  of  (3)  as 


y  =  cue  +  k,  for  X  =  (u  •  v) 


(29) 


The  particular  solution  for  the  non -homogeneous  system  (1) 
obtained  by  considering  the  system 


is 


y '  -  Ay  =  bu 


(30) 


taking 


-+• 

y  =  Pbu , 


(31) 


for  P  an  n  *  n  matrix,  and  substituting  in  (30),  we  have 


APbu  =  bu 


-  u  ©  ^  ©  pQu 


-+ 

U 


(32) 
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We  observe  that  we  can  obtain  a  solution  by  taking 


P  = 


•(u  vt) 

X1 


(33) 


(32)  becomes 


•<:  e^)©  ^  Q,  ^  ©  n  - 


-*■  (v 

u  - 


-*•  -y 
u)  (v 
TT 


-*■ 

u) 


-¥ 

u 


(34 ) 


Therefore 


■y 

y 


b(u  O  vr) 
- 1* - U 


bu 


(35 ) 


is  a  particular  solution  of  (1). 

Substituting  (29)  and  (35)  into  (2),  we  have  the  complete  solution 


-* 

y 

^homogeneous 


+  Xt 


cue 


+  k 


-> 

y 

^particular 


-b 

X 


u 


(36  ) 


and 


y 


•+■  Xt 
cue 


-y 

+  k 


(37  ) 
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APPENDIX  IV 


DERIVATION  OF  SURFACE  ELEMENT 
FOR  A  HYPERSPHERE 


The  surface  content  of  an  n-dimensional  hypersphere  is  a  volume  in 
n-1  dimensions.  According  to  Somerville  (1958),  this  volume  is 


S 

n 


n 


2tt 


n-1 

r 


(1) 


where  r  is  the  radius  of  the  hypersphere  and  T  is  the  gamma  function, 
the  generalized  factorial.  Note  that 


r(x)  =  (x  -  i)  r (x  -  i) 


To  find  the  equation  for  a  volume  element,  we  must  consider  the 
geometry  of  the  space.  An  (n-1 )-dimensional  hypersphere  has  the  property 
that  if  a  point  be  chosen  on  it  as  an  origin,  and  a  sequence  of  n-1 
orthogonal  great  circles  drawn,  these  circles  will  intersect  again  at  a 
point  diametrically  opposite  the  origin,  the  line  connecting  the  two  points 
passing  through  the  center  of  the  hypersphere.  These  two  points  will  be 
termed  respectively,  the  North  and  South  poles  of  the  hypersphere. 

We  will  use  a  spherical  coordinate  system  for  computing  the  volume, 
with  n  coordinates:  a  radius  p  and  n-1  angles,  4^,  •  •  •  • 

As  in  3-dimensional  spherical  coordinates,  the  angles  are  of  different 

kinds.  In  3  dimensions,  4>1  is  a  conical  angle  from  the  North  pole  arid 
<J>2  is  a  circular  angle  in  a  plane  parallel  to  the  equatorial  plane,  as 
illustrated  in  Figure  1.  If  p  is  fixed,  x  is  constrained  to  lie  on  the 
sphere  of  radius  p  and  with  poles  S  and  N.  If  is  then  fixed, 

x  is  constrained  to  lie  on  the  circle  C,  of  radius  p  sin<j>^,  the  inter¬ 
section  of  the  sphere  determined  by  p  and  the  cone  determined  by  <|>  . 
Fixing  $  then  determines  the  exact  location  of  the  point  on  C.  Angle 
<J)^  is  measured  from  the  line  ON,  and  angle  ^  is  measured  from  the 
plane  determined  by  the  line  ON  and  the  great  circle  -  more  directly 
from  the  point  on  C  given  by  the  intersection  of  C  and  L^.  Note  that 
<J>^,  as  a  conical  angle,  ranges  from  0  to  tt  only,  while  ,  as  a  planar 
angle,  ranges  from  0  to  2tt  . 
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*2> 


Figure  1.  Three-dimensional  Spherical  Coordinate  System 

Figure  1.  3-Dimensional  Spherical 
Coordinate  System 
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This  3- dimensional  system  can  be  extended  to  n-dimensions  in  a 

straight  forward  manner.  Fixing  p  determines  a  hypersphere.  Fixing  <f>^ 

determines  an  n-dimensional  hypercone,  the  locus  of  all  points  x  such 

that  the  line  Ox  is  at  an  angle  ^  with  the  line  ON.  The  intersection 

of  the  two  is  clearly  an  n-1  dimensional  hypersphere  with  radius  psinS^. 

If  we  select  arbitrarily  a  great  circle  on  the  n-dimensional  hypersphere 

the  intersection  of  with  the  (n-l)-dimensional  hypersphere  gives  us  a 

North  pole  for  the  latter  hypersphere. 

We  can  continue  this  process  inductively,  defining  successive  (n‘ -  i  +  1) 

dimensional  hyperconical  angles  which  restrict  x  to  successive  (n  -  i)- 

dimensional  hyperspheres  with  radius  psin<f>^  sin^  •••  sin<{K,  until  we  come 

to  <f>  0,  (which  corresponds  to  the  three-dimensional  angle  <J>  of  Figure  1). 

n-z  »  i 

Angle  ^  is  then  a  planar  angle.  We  define  the  North  pole  of  the  (n  -  i)- 
dimensional  hypersphere  by  its  intersection  with  line  (great  circle)  of 

the  (n  -  i  +  1) -dimensional  hypersphere.  The  successive  surfaces  L^, 

(L  ,  L  ),  (L,  ,  L0#  LQ) ,, , , ,(Ln , , , . ,L  0)  are  coordinate  surfaces  imbedded  in 
the  original  n-dimensional  hypersphere. 

A  surface  content  element  of  the  n-dimensional  hypersphere  is  thus 

dSn  =  (pd«j>1)(psin<>1d<»2)  •••  (psin^  •••  sin^^&J^^) 

and  the  allowed  intervals  are  0  <  <  n  for  the  hyperconical  angles 

2  and  0  <  ^  i  <  2tt  for  the  planar  angle. 
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APPENDIX  V 


CONTROL  SYSTEM  FORMULATION 
OF  NEUROMIME  INTERACTION 


A.  Introduction 

Problems  of  interacting  outputs  in  neuromime  nets  are  of  such  a  nature 
to  lend  themselves  to  analysis  with  control  system  techniques.  This  note 
gives  a  control  system  formulation  of  the  S  input  computations  of 
neuromimes.  Figure  1  may  be  compared  with  Figure  2.5  of  the  text  and  the 
structures  seen  to  be  of  the  same  kind. 
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B.  Description  of  Model 

Figure  1  is  a  block  diagram  of  a  two-element  neuron  net  model. 

Standard  control  engineering  notation  is  employed,  wherein  r's  represent 
input  time  functions  and  c's  represent  output  time  responses.  Each  element 
has  a  forward  gain  of  unity  and  an  output  which  interacts  with  each  other 
element  by  furnishing  an  additional  input  to  each  other  element  of  a  magni¬ 
tude  dictated  by  an  "interaction  coefficient"  (K12  and  K21  in  Figure  1). 

This  form  of  model  can,  of  course,  be  generalized  to  an  n-element  net  with 
n(n-l)  interaction  paths. 

The  responses  (c's)  of  this  model  to  a  set  of  inputs  (r's)  is  controlled 
by  the  values  of  the  interaction  coefficients.  The  analysis  to  follow  will 
investigate  this  relationship  between  the  neuron  output  and  the  degree  and 
form  of  interaction  associated  with  the  other  neuron  outputs.  The  two-element 
model  will  be  employed  throughout  since  it  is  the  least  cumbersome  to  handle 
mathematically  and  produces  conclusions  which  also  hold  for  an  n-element  net. 

C.  Stability  Criteria 

The  system  equations  for  the  two-element  model  of  Figure  1  are 


C1  =  ri  +  K12C2 


°2  =  r2  +K21C1 


In  matrix  notation  this  equation  set  is  expressed  as 


c, 

r°  k,  i 

C, 

r 

1 

- 

12 

1 

+ 

1 

c 

K  ,  0 

c 

1 

r 

2 

21 

2 

2 

or 


(1) 

(2) 


(3) 


(4) 


The  output/input  relationships  obtained  from  simultaneous  solution  of 
Equations  (1)  and  (2)  are 
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forward  gain 


a 

o 

O 


m  o 
U  -H 

o  ^ 

+>  4h 

a  a> 

M  O 
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F~» 

05 

4-« 

O 


txO 

05 

•H 
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O 

O 

H 


a 

to 

•H 
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wo-element  Neuron  Net  Model 


(5) 


1  -  K12K21 


and 


1  +  K21(ri/r2) 

1  -  K12K21 


(6) 


According  to  Clarke  (1062),  Equations  (5)  or  (6)  reveal  that  the  outputs  will 
increase  without  bound  for  any  condition  where  the  denominator,  1  -  Ki2K21* 
is  equal  to  or  less  than  zero.  In  terms  of  the  interaction  coefficients, 
the  stability  constraint  is  that  the  product  of  the  interaction  coefficients 
be  less  than  +1. 

This  stability  criteria  can  be  more  concisely  expressed  by  referring 
to  the  matrix  form  of  the  system  equations.  Equation  (4).  The  same  stability 
constraint  can  be  specified  in  terms  of  the  determinant  of  the  coefficient 
matrix,  i.e.. 


*  0  . 


(7) 


For  an  n-element  neuron  net  model  the  matrix  form  of  the  system  equations 
becomes 


1 

-K12 

~K1  3 

-K21 

1 

-K2  3 

-K31 

-K32 

1 

r 

1 


2 

3 


(8) 
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and  hence  the  stability  criteria  is 


- 1 

M 

CM 

f— t 

54 

1 

1 - 

« 

• 

• 

CO 
«— l 

54 

1 

*K21 

l 

— K  «  •  *  • 
*23 

-K3l 

-K32 

1  •  •  * 

We  emphasize  that  this  stability  criteria  is  in  no  way  dependent  on 
the  inputs;  only  the  signs  and  magnitudes  of  the  interaction  coefficients 
determine  the  stability  of  the  output  responses.  Of  course,  for  any  stable 
set  of  interaction  coefficients  the  output  response  will  be  directly  affected 
by  the  inputs. 

D .  Performance  of  Refined  Model 

In  real  physiological  systems  transmission  times  are  finite.  To 
extend  the  utility  of  our  basic  neuron  net  model  we  are  behooved  to 
incorporate  this  physiological  fact. 

We  have  found  the  Laplace  transform  to  be  a  very  useful  tool.  (1)  It 
enables  us  to  obtain  closed  form  expressions  relating  output  to  input 
for  any  type  of  feedback  system.  (2)  It  affords  easy  handling  of  systems 
which  contain  time  delays. 

Our  initial  efforts  to  incorporate  these  refinements  of  the  neuron 
model  have  produced  some  interesting  results.  For  instance,  for  a  model 
containing  neuron  elements  of  +1  forward  gain  and  positive  interaction 
coefficients  less  than  unity: 

1.  A  set  of  step  function  inputs  results  in  a  set  of  outputs  which 
are  increasing,  convergent  "staircase"  functions; 

2.  The  final  values  of  the  staircase  functions  are  determined  by  the 
magnitudes  of  the  interaction  coefficients  (The  final  value  may  be  ascer¬ 
tained  from  an  equation  of  the  form  of  (5)  of  Section  C); 
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3.  The  time  rates  of  convergence  (time  between  steps  of  staircase) 
are  dependent  on  the  magnitudes  of  the  transmission  time  delays  associated 
with  the  forward  paths  of  the  neurons  and  the  interaction  paths  between 
neurons. 

A  typical  example  illustrating  these  results  for  a  neuron  net  of  two 
elements  is  presented  in  Figure  2. 

When  values  of  +1  are  assigned  to  all  the  interaction  coefficients, 
the  staircase  response  function  increases  without  bound  —  just  as  predicted 
by  the  stability  criteria  formulated  in  Section  C. 

E.  Recommendations  for  Further  Study  and  Refinement  of  Model 

Our  initial  study  of  the  performance  characteristics  of  this  refined 
neuron  model  indicate  that  it  may  be  quite  useful  in  aiding  the  understanding 
of  phenomena  associated  with  neural  networks  such  as  1)  Summation, 

2)  Variability  and  modification  of  response,  and  3)  the  independence  of 
the  rhythms  of  stimulus  and  response. 

Further  study  should  involve: 

1.  Investigating  model  response  for  rectangular  pulse  inputs,  impulse 
inputs,  and  input  sets  which  contain  both  positive  and  negative  inputs; 

2.  Further  study  of  pertinent  physiological  phenomena  to  determine 
what  additional  refinements  might  be  profitably  incorporated. 
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Preconditions 

Inputs:  Positive  unit  step  functions;  forward  gains:  +1  (with  10  millisecond 
transmission  time  delays)  interaction  coefficients:  K12  =  .5,  K  =  .75  (with 
5  millisecord  transmission  time  delays);  total  loop  transmission  time  delay: 

30  milliseconds 
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APPENDIX  VI 


COMPARISON  OF  ADAPTATION  EQUATIONS 
FOR  TWO  NEUROMIME  SIMULATORS 


The  adaptation  equations  of  the  two  neuromime  simulators  are  slightly 
different.  The  7094  simulation  program  developed  by  Service  Bureau 
Corporation1  uses  the  equations  discussed  in  the  body  of  this  report, 

A.g  (t)  =  F  •  R(t)  •  (*  -  x>  (1) 

=  F  •  R(t)  •  d  for  R(t)  >  0 


=  0  for  R(t)  <  0 

The  special  purpose  computer  designed  by  Teledyne2  uses  the  equations: 


^-g(t)  =  F  •  [R]  •  d 


(2) 


where 


=  1  if  R(t )  >  0 
[R] 

=  0  if  R(t )  <  0 

Note  that  both  systems  are  zero  when  the  output  of  the  neurcmimes  is 
zero,  which  means  that  they  will  both  adapt  only  to  a  positive  simulus .  The 
former,  however,  has  a  multiplicative  factor  R(t),  while  the  latter  has  a 
factor  of  1  when  it  is  adapting.  This  means  that  system  (2)  will  adapt  at 
a  constant  rate  independent  of  the  output,  while  system  (1)  adapts  at  a  rate 
which  increases  exponentially  with  time . 


tracer,  Franklin,  and  Orr,  Kenneth,  Neuromime  Network  Simulation, 

Final  Report,  Contract  AF33(657)-11194,  Service  Bureau  Corp., 

14  August  1964. 

2Neural  Network  Simulator,  Final  Engineering  Report,  Contract  AF33(657)-8489 , 
Teledyne  Systems  Co.,  January  1965. 
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A  significant  effect  of  this  difference  is  that  system  (1)  will  have 
greater  differential  adaptation  capability  and  less  adaptation  retention 
than  system  (2).  Solving  for  R(t)  in  (1)  (See  Section  V  of  report),  we 
have 

FM  fit  (3) 

R(t)  =  R(0)e 


where  is  a  constant  and  5  =  | d | 2 

and,  solving  (2),  we  have 

R(t)  =  R(0)  +  Ffit  (4) 

From  (3)  and  (4),  one  can  see  that  if  two  neuromimes  have  inputs  with  the 
same  5  ,  then  the  difference  between  their  outputs  will  be  constant  in 
system  (2),  and  will  increase  with  system  (1)  if  F  >  0.  If  <  0,  the 
difference  will  decrease  in  system  (1.)  and  again  remain  the  same  in  system  (2). 
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